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CROSS-REFERENCES TO RELATED APPLICATIONS 
[0001] The present application is related to U.S. Provisional Patent Application No. 
5 60/53 5,284, filed January 8, 2004, U. S . Provisional Patent Application No. 60/544,4 1 1 , filed 
February 12, 2004; U.S. Provisional Patent Application No. 60/546,63 1 , filed February 20, 
2004; U.S. Provisional Patent Application No. 60/555,813, filed March 23, 2004; U.S. 
Provisional Patent Application No. 60/570,891, filed May 12, 2004; each of which are 
incorporated herein by reference in their entirety for all purposes. 

1 0 Field of the Invention 

[0002] The present invention relates to O-linked glycosylated glycopeptides, particularly 
therapeutic peptides and peptide mutants that include O-linked glycosylation sites not present 
in the wild-type peptide. 

[0003] The administration of glycosylated and non-glycosylated peptides for engendering a 
1 5 particular physiological response is well known in the medicinal arts. For example, both 
purified and recombinant hGH are used for treating conditions and diseases due to hGH 
deficiency, e.g., dwarfism in children, interferon has known antiviral activity and granulocyte 
colony stimulating factor stimulates the production of white blood cells. 

[0004] A principal factor that has limited the use of therapeutic peptides is the difficulty 
20 inherent in engineering an expression system to express a peptide having the glycosylation 
pattern of the wild-type peptide. As is known in the art, improperly or incompletely 
glycosylated peptides can be immunogenic, leading to neutralization of the peptide and/or 
leading to the development of an allergic response. Other deficiencies of recombinantly 
produced glycopeptides include suboptimal potency and rapid clearance rates. 

25 [0005] One approach to solving the problems inherent in the production of glycosylated 
peptide therapeutics has been to modify the peptides in vitro after they are expressed. Post- 
expression in vitro modification has been used to both modify of glycan structures and 
introduce of glycans at novel sites. A comprehensive toolbox of recombinant eukaryotic 
glycosyltransferases has become available, making in vitro enzymatic synthesis of 

30 mammalian glycoconjugates with custom designed glycosylation patterns and glycosyl 
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structures possible. See, for example, U.S. Patent No. 5,876,980; 6,030,815; 5,728,554; 
5,922,577; and WO/9831826; US2003180835; and WO 03/031464. 

[0006] In addition to manipulating the structure of glycosyl groups on polypeptides, 
glycopeptides can be prepared with one or more non-saccharide modifying groups, such as 
5 water soluble polymers. An exemplary polymer that has been conjugated to peptides is 

poly(ethylene glycol) ("PEG"). The use of PEG to derivatize peptide therapeutics has been 
demonstrated to reduce the immunogenicity of the peptides. For example, U.S. Pat. No. 
4,179,337 (Davis et al.) discloses non-immunogenic polypeptides such as enzymes and 
peptide hormones coupled to polyethylene glycol (PEG) or polypropylene glycol. In addition 
10 to reduced immunogenicity, the clearance time in circulation is prolonged due to the 
increased size of the PEG-conjugate of the polypeptides in question. 

[0007] The principal mode of attachment of PEG, and its derivatives, to peptides is a non- 
specific bonding through a peptide amino acid residue (see e.g., U.S. Patent No. 4,088,538 
U.S. Patent No. 4,496,689, U.S. Patent No. 4,414,147, U.S. Patent No. 4,055,635, and PCT 
1 5 WO 87/00056). Another mode of attaching PEG to peptides is through the non-specific 
oxidation of glycosyl residues on a glycopeptide (see e.g., WO 94/05332). 

[0008] In these non-specific methods, poly(ethyleneglycol) is added in a random, non- 
specific maimer to reactive residues on a peptide backbone. Of course, random addition of 
PEG molecules has its drawbacks, including a lack of homogeneity of the final product, and 
20 the possibility for reduction in the biological or enzymatic activity of the peptide. Therefore, 
for the production of therapeutic peptides, a derivitization strategy that results in the 
formation of a specifically labeled, readily characterizable, essentially homogeneous product 
is superior. 

[0009] Specifically labeled, homogeneous peptide therapeutics can be produced in vitro 
25 through the action of enzymes. Unlike the typical non-specific methods for attaching a 

synthetic polymer or other label to a peptide, enzyme-based syntheses have the advantages of 
regioselectivity and stereoselectivity. Two principal classes of enzymes for use in the 
synthesis of labeled peptides are glycosyltransferases (e.g., sialyltransferases, 
oligosaccharyltransferases, N-acetylglucosaminyltransferases), and glycosidases. These 
30 enzymes can be used for the specific attachment of sugars which can be subsequently 

modified to comprise a therapeutic moiety. Alternatively, glycosyltransferases and modified 
glycosidases can be used to directly transfer modified sugars to a peptide backbone (see e.g., 

2 



WO 2005/070138 PCT/US2005/000799 

U.S. Patent 6,399,336, and U.S. Patent Application Publications 20030040037, 
20040132640, 20040137557, 20040126838, and 20040142856, each of which are 
incorporated by reference herein). Methods combining both chemical and enzymatic 
synthetic elements are also known {see e.g., Yamamoto et al. Carbohydr. Res. 305: 415-422 
5 (1998) and U.S. Patent Application Publication 20040137557 which is incorporated herein by 
reference). 

[0010] Carbohydrates are attached to glycopeptides in several ways of which N-linked to 
asparagine and mucin-type O-linked to serine and threonine are the most relevant for 
recombinant glycoprotein therapeuctics. Unfortunately, not all polypeptide comprise an N- 

10 or O-linked glycosylation site as part of their primary amino acid sequence. In other cases an 
existing glycosylation site may be inconvenient for the attachment of a modifying group (e.g., 
a water-soluble or water -insoluable polymers, therapeutic moieties, and or biomolecules) to 
the polypeptide, or attachment of such moieties at that site may cause an undesirable decrease 
in biological activity of the polypeptide. Thus there is a need in the art for methods that 

1 5 permit both the precise creation of glycosylation sites and the ability to precisely direct the 
modification of those sites. 

SUMMARY OF THE INVENTION 
[0011] It is a discovery of the present invention that enzymatic glycoconjugation reactions 
can be specifically targeted to O-linked glycosylation sites and to glycosyl residues that are 

20 attached to O-linked glycosylation sites. The targeted O-linked glycosylation sites can be 
sites native to a wild-type peptide or, alternatively, they can be introduced into a peptide by 
mutation. Accordingly, the present invention provides polypeptides comprising mutated sites 
suitable for O-linked glycosylation and pharmaceutical compositions thereof. In addition, the 
present invention provides methods of making such polypeptides and using such polypeptides 

25 and/or pharmaceutical compositions thereof for therapeutic treatments. 

[0012] Thus, in a first aspect, the invention provides an isolated polypeptide comprising a 
mutant peptide sequence, wherein the mutant peptide sequence encodes an O-linked 
glycosylation site that does not exist in the corresponding wild-type polypeptide. 
[0013] In one embodiment, the isolated polypeptide is a G-CSF polypeptide. 

30 [0014] In one embodiment, the G-CSF polypeptide comprises a mutant peptide sequence 
with the formula of M^TPLGP or M^PZ^nTPLGP. In this embodiment, the 
superscript, 1, denotes the first position of the amino acid sequence of the wild-type G-CSF 
sequence (SEQ ID NO:3), the subscripts n and m are integers selected from 0 to 3, and at 
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least one of X and B is threonine or serine, and when more than one of X and B is threonine 
or serine, the identity of these moieties is independently selected. Also in this embodiment, Z 
is selected from glutamate, any uncharged amino acid or dipeptide combination including 
MQ, GQ, and MV. In another embodiment, the G-CSF polypeptide comprises a mutant 
5 peptide sequence selected from the sequences consisting of MVTPLGP, MQTPLGP, 

MIATPLGP, MATPLGP, MPTQGAMPLGP , MVQTPLGP, MQSTPLGP, MGQTPLGP, 
MAPTSSSPLGP, and MAPTPLGPA. 

[0015] In another embodiment, the G-CSF polypeptide comprises a mutant peptide sequence 
with the formula of M ! TPXBO r P. In this embodiment the superscript, 1, denotes the first 

10 position of the amino acid sequence of the wild-type G-CSF sequence (SEQ ID NO:3), and 
the subscript r is an integer selected from 0 to 3, and at least one of X, B and O is threonine or 
serine, and when more than one of X, B and O is threonine or serine, the identity of these 
moieties is independently selected. In another embodiment, the G-CSF polypeptide 
comprises a mutant peptide sequence selected from the sequences consisting of: MTPTLGP, 

1 5 MTPTQLGP, MTPTSLGP, MTPTQGP, MTPTSSP, M^PQTP, M^PTGP, M X TPLTP, 
M^PNTGP, MTPLGP, M^PVTP, M^PMVTP, and MT 1 P 2 TQGL 3 G 4 P 5 A 6 S 7 . 
[0016] In another embodiment, the G-CSF polypeptide comprises a mutant peptide sequence 
with the formula of LGX 53 B 0 LGI, wherein the superscript denotes the position of the amino 
acid in the wild type G-CSF amino acid sequence, and X is histidine, serine, arginine, 

20 glutamic acid or tyrosine, and B is either threonine or serine, and o is an integer from 0 to 3. 
In another embodiment, the G-CSF polypeptide comprises a mutant peptide sequence 
selected from the sequences consisting of: LGHTLGI, LGSSLGI, LGYSLGI, LGESLGI, and 
LGSTLGI. 

[0017] In another embodiment, the G-CSF polypeptide comprises a mutant peptide sequence 
25 with the formula of P 129 Z m J q O r X n PT wherein the superscript denotes the position of the 

amino acid in the wild type G-CSF amino acid sequence, and Z, J, O and X are independently 
selected from threonine or serine, and m, q, r, and n are integers independently selected from 
0 to 3. In another embodiment, the G-CSF polypeptide comprises a mutant peptide sequence 
selected from the sequences consisting of: P 129 ATQPT, P 129 TLGPT, P I29 TQGPT, P 129 TSSPT, 
30 P 129 TQGAPT, P 129 NTGPT, PALQPTQT, P 129 ALTPT, P 129 MVTPT, P 129 ASSTPT, P 129 TTQP, 
P 129 NTLP, P 129 TLQP, MAP 1 29 ATQPTQ GAM, and MP 129 ATTQPTQGAM. 
[0018] In another embodiment, the G-CSF polypeptide comprises a mutant peptide sequence 
with the formula of PZ m U s J q P 61 O r X n B 0 C wherein the superscript denotes the position of the 
amino acid in the wild type G-CSF amino acid sequence, and at least one of Z, J, O, and U is 
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selected from threonine or serine, and when more than one of Z, J, O and U is threonine or 
serine, each is independently selected, X and B are any uncharged amino acid or glutamate, 
and m, s, q, r, n, and o are integers independently selected from 0 to 3. In another 
embodiment the G-CSF polypeptide comprises a mutant peptide sequence selected from the 
5 sequences consisting of: P 61 TSSC, P 61 TSSAC, LGIPTA P 61 LSSC, LGIPTQ P 61 LSSC, 
LGIPTQG P 61 LSSC, LGIPQT P 61 LSSC, LGIPTS P 61 LSSC 5 LGIPTS P 61 LSSC, 
LGIPTQP 61 LSSC, LGTPWAP 61 LSSC, LGTPFA P 61 LSSC, P 61 FTP, and SLGAP 58 TAP 61 LSS. 
[0019] In another embodiment the G-CSF polypeptide comprises a mutant peptide sequence 
with the formula of 0aGpJqO r P 175 XnB o Z m U s x Pt wherein the superscript denotes the position of 

10 the amino acid in SEQ ID NO:3, and at least one of Z, U, O, J, G, 0, B and X is threonine or 
serine and when more than one of Z, U, O, J, G, 0, B and X are threonine or serine, they are 
independently selected. 0 is optionally R, and G is optionally H. The symbol \P represents 
any uncharged amino acid residue or glutamate, and a, p, q, r, n, o, m, s, and t are integers 
independently selected from 0 to 3. In another embodiment the G-CSF polypeptide 

1 5 comprises a mutant peptide sequence selected from the sequences consisting of: 

RHLAQTP 175 , RHLAGQTP 175 , QP 175 TQGAMP, RHLAQTP 175 AM, QP 175 TSSAP, 
QP 175 TSSAP, QP 175 TQGAMP, QP 175 TQGAM, QP 175 TQGA, QP 175 TVM, QP 175 NTGP, and 
QP 175 QTLP. 

[0020] In another embodiment the G-CSF polypeptide comprises a mutant peptide sequence 
20 selected from the sequences P 133 TQTAMP 139 , P 133 TQGTMP, P 133 TQGTNP, P 133 TQGTLP, 
and PALQP 133 TQTAMPA. 

[0021] In another embodiment, the isolatedpolypepti.de is an hGH polypeptide. 
[0022] In one embodiment, the hGH polypeptide comprises a mutant peptide sequence with 
the formula of P 133 JXBOZUK 140 QTYS, wherein superscripts denote the position of the amino 
25 acid in (SEQ ID NO:20); and J is selected from threonine and arginine; X is selected from 
alanine, glutamine, isoleucine, and threonine; B is selected from glycine, alanine, leucine, 
valine, asparagine, glutamine, and threonine; O is selected from tyrosine, serine, alanine, and 
threonine; and Z is selected from isoleucine and methionine; and U is selected from 

i 

phenylalanine and proline. In another embodiment, the hGH polypeptide comprises a mutant 
30 peptide sequence selected from the sequences consisting of: PTTGQIFK, PTTAQIFK, 

PTTLQIFK, PTTLYVFK, PTTVQIFK, PTTVSIFKj PTTNQIFK, PTTQQIFK, PTATQIFK, 
PTQGQIFK, PTQGAIFK, PTQGAMFK, PTIGQIFK, PTINQIFK, PTINTIFK, PTILQIFK, 
PTIVQIFK, PTIQQIFK, PTIAQIFK, P 133 TTTQIFK 140 QTYS, and P 133 TQGAMPK 140 QTYS. 



5 



WO 2005/070138 PCT/US2005/000799 

[0023] In another embodiment, the hGH polypeptide comprises a mutant peptide sequence 
with the formula of P 133 RTGQIPTQBYS wherein superscripts denote the position of the 
amino acid in SEQ ID NO:20; and B is selected from alanine and threonine. In another 
embodiment, the hGH polypeptide comprises a mutant peptide sequence selected from the 
5 sequences consisting of: PRTGQIPTQTYS and PRTGQIPTQAYS . 

[0024] In another embodiment, the hGH polypeptide comprises a mutant peptide sequence 
with the formula of L 128 XTBOP 133 UTG wherein superscripts denote the position of the amino 
acid inSEQ ID NO:20; and X is selected from glutamic acid, valine and alanine; B is selcted 
from glutamine, glutamic acid, and glycine; O is selcted from serine and threonine; and U is 

10 selected from arginine, serine, alanine and leucine. In another embodiment, the hGH 

polypeptide comprises a mutant peptide sequence selected from the sequences consisting of: 
LETQSP 133 RTG, LETQSP 133 STG, LETQSP 133 ATG, LETQSP 133 LTG, LETETP 133 R, 
LETETP 133 A, LVTQSP 133 RTG, LVTETP I33 RTG, LVTETP 133 ATG, and L ATGSP 1 33 RTG. 
[0025] In another embodiment the hGH polypeptide comprises a mutant peptide sequence 

1 5 with the formula of M^PTXnZmOPLSRL wherein the superscript 1 , denotes the position of 
the amino acid in SEQ ID NO: 19; and B is selected from phenylalanine, valine and alanine or 
a combination thereof; X is selected from glutamate, valine and proline Z is threonine; O is 
selected from leucine and isoleucine; and when X is proline, Z is threonine; and wherein n 
and m are integers selected from 0 and 2. In another embodiment, the hGH polypeptide 

20 comprises a mutant peptide sequence selected from the sequences consisting of: M^FPTE 
IPLSRL, M^PTV LPLSRL, and M^PTPTIPLSRL. 

[0026] In still another embodiment the the hGH polypeptide comprises the following mutant 
peptide sequence: MVTPTIPLSRL. 

[0027] In still another embodiment the hGH polypeptide comprises a mutant peptide 
25 sequence selected from Jv^APTSSPTIPI^SR 9 and DGSP 133 NTGQIFK 140 . 

[0028] In another embodiment the isolated polypeptide is an IFN alpha polypeptide. 
[0029] In one embodiment, the INF alpha polypeptide has a peptide sequence comprising a 
mutant amino acid sequence, and the peptide sequence corresponds to a region of INF alpha 2 
having a sequence as shown in SEQ NO:22, and wherein the mutant amino acid sequence 

30 contains a mutation at a position corresponding to T 106 of INF alpha 2. In another 

i 

embodiment the IFN alpha polypeptide is selected from the group consisting of IFN alpha, 
IFN alpha 4, IFN alpha 5, IFN alpha 6, IFN alpha 7, IFN alpha 8, IFN alpha 10, IFN alpha 
14, IFN alpha 16, IFN alpha 17, and IFN alpha 21. In yet another embodiment, the IFN alpha 
polypeptide is an IFN alpha polypeptide comprising a mutant amino acid sequence selected 
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from the group consisting of: 99 C VMQEERVTETPLMNAD SIL 1 1 8 , 
"CVMQEEGVTETPLMNADSIL 118 , and "CVMQGVGVTETPLMNADSIL 118 . In still 
another embodiment, the IFN alpha polypeptide is an IFN alpha 4 polypeptide comprising a 
mutant amino acid sequence selected from the group consisting of: 
5 99 CVIQEVGVTETPLMNVDSIL 118 5 and "CVIQGVGVTETPLMKEDSIL 118 . In another 
embodiment, the IFN alpha polypeptide is an IFN alpha 5 polypeptide comprising a mutant 
amino acid sequence selected from the group consisting of: 

"CMMQEVGVTDTPLMNVDSIL 1 1S , "CMMQEVGVTETPLMNVDSIL 118 and 
"CMMQGVGVTDTPLMNVDSIL 118 . In an another embodiment, the IFN alpha 

1 0 polypeptide is an IFN alpha 6 polypeptide comprising a mutant amino acid sequence selected 
from the group consisting of: "CVMQEVWVTGTPLMNEDSIL 118 , 
"CVMQEVGVTGTPLMNEDSIL 118 , and "CVMQGVGVTETPLMNEDSIL 118 In yet an 
another embodiment, the IFN alpha polypeptide is an IFN alpha 7 polypeptide comprising a 
mutant amino acid sequence selected from the group consisting of: 

1 5 "CVIQEVGVTETPLMNEDFIL 118 , and "CVIQGVGVTETPLMNEDFIL 118 . In still another 
embodiment, the IFN alpha polypeptide is an IFN alpha 8 polypeptide comprising a mutant 
amino acid sequence selected from the group consisting of: 

"CVMQEVGVTESPLMYEDSIL 118 , and "CVMQGVGVTESPLMYEDSIL 118 . In another 
embodiment, the IFN alpha polypeptide is an IFN alpha 1 0 polypeptide comprising a mutant 
20 amino acid sequence selected from the group consisting of: 

"CVIQEVGVTETPLMNEDSIL 118 , and "CVIQGVGVTETPLMNEDSIL 118 . In another 
embodiment, the IFN alpha polypeptide is an IFN alpha 1 4 polypeptide comprising a mutant 
amino acid sequence selected from the group consisting of: 

"CVIQEVGVTETPLMNEDSIL 118 , and "CVIQGVGVTETPLMNEDSIL 118 . In another 
25 embodiment, the IFN alpha polypeptide is an IFN alpha 1 6 polypeptide comprising a mutant 
amino acid sequence selected from the group consisting of: 
"CVTQEVGVTEIPLMNEDSIL 1 1 8 , "CVTQEVGVTETPLMNEDSIL 118 , and 
"CVTQGVGVTETPLMNEDSIL 118 . In still another embodiment, the IFN alpha polypeptide 
is an IFN alpha 1 7 polypeptide comprising a mutant amino acid sequence selected from the 
3 0 group consisting of: "CVIQEVGMTETPLMNEDSIL 1 1 8 , 

"CVIQEVGVTETPLMNEDSIL 118 , and "CVIQGVGMTETPLMNEDSIL 118 . In one more 
embodiment, the IFN alpha polypeptide is an IFN alpha 21 polypeptide comprising a mutant 
amino acid sequence selected from the group consisting of: 
"CVIQEVGVTETPLMNVDSIL 118 , and "CVIQGVGVTETPLMNVDSIL 118 . 
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[0030] In a second aspect, the invention provides an isolated nucleic acid encoding a 
polypeptide comprising a mutant peptide sequence, wherein the mutant peptide sequence 
encodes an O-linked glycosylation site that does not exist in the corresponding wild-type 
polypeptide. In one embodiment the nucleic acid encoding a polypeptide comprising a 
5 mutant peptide sequence is comprised within an expression cassette. In another related 
embodiment, the present invention provides a cell comprises the nucleic acid of the present 
invention. 

[0031] In a third aspect, the isolated polypeptide comprising a mutant peptide sequence, that 

» 

encodes an O-linked glycosylation site that not existing in the corresponding wild-type 
10 polypeptide, has a formula selected from: 



AA— O— GalNAc — X ; and 



AA— O— GalNAc — X 



15 



20 



wherein AA is an amino acid side chain that comprises a hydroxyl moiety that is within the 
mutant polypeptide sequence; and X is a modifying group or a saccharyl moiety. In one 
embodiment X comprises a group selected from sialyl, galactosyl and Gal-Sia moieties, 
wherein at least one of said sialyl, galactosyl and Gal-Sia comprises a modifying group. 
[0032] In another embodiment X comprises the moiety: 



.OH 



COOH 




G— HN 



wherein D is a member selected from -OH and R -L-HN-;G is a member selected from R -L- 
and -C(0)(Ci-C6)alkyl; R 1 is a moiety comprising a member selected a moiety comprising a 
straight-chain or branched poly(ethylene glycol) residue; and L is a linker which is a member 
selected from a bond, substituted or unsubstituted alkyl and substituted or unsubstituted 
heteroalkyl, such that when D is OH, G is R*-L-, and when G is -C(0)(Ci-C6)alkyl, D is 
R X -L-NH-. 

[0033] In another embodiment X comprises the structure: 
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HOH 2 C 




in which L is a substituted or unsubstituted alkyl or substituted or unsubstituted heteroalkyl 
group; and n is selected from the integers from 0 to about 500. 
[0034] In another embodiment, X comprises the structure: 



5 

in which s is selected from the integers from 0 to 20. 

[0035] In a fourth aspect the invention provides a method for making a glycoconjugate of an 
isolated polypeptide comprising a mutant peptide sequence encoding an O-linked 
glycosylation site that does not existing in the corresponding wild-type polypeptide, 
1 0 comprising the steps of: 

(a) recombinantly producing the mutant polypeptide, and 

(b) enzymatically glycosylating the mutant polypeptide with a modified sugar at said O- 
linked glycosylation site. 

[0036] In a fifth aspect the invention provides a pharmaceutical composition of an isolated 
1 5 polypeptide comprising a mutant peptide sequence, wherein the mutant peptide sequence 
encodes an O-linked glycosylation site that does not exist in the corresponding wild-type 
polypeptide. 

[0037] In one embodiment the pharmaceutical composition comprises an effective amount of 
a G-CSF polypeptide of the invention glycoconjugated with a modified sugar. In a related 
20 embodiment, the modified sugar is modified with a member selected from poly(ethylene 
glycol) and methoxy-poly (ethylene glycol) (m-PEG). 
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[0038] In another embodiment the pharmaceutical composition comprises an effective 
amount of an hGH polypeptide of the invention glycoconjugated with a modified sugar. In a 
related embodiment, the modified sugar is modified with a member selected from 
poly (ethylene glycol) and methoxy-poly (ethylene glycol) (m-PEG). 

5 [0039] In another embodiment the pharmaceutical composition comprises an effective 

amount of an granulocyte macrophage colony stimulating factor polypeptide of the invention 
glycoconjugated with a modified sugar. In a related embodiment, the modified sugar is 
modified with a member selected from poly (ethylene glycol) and methoxy-poly(ethylene 
glycol) (m-PEG). 

1 0 [0040] In another embodiment the pharmaceutical composition comprises an effective 

amount of an IFN alpha polypeptide of the invention glycoconjugated with a modified sugar. 
In a related embodiment, the modified sugar is modified with a member selected from 
poly (ethylene glycol) and methoxy-poly (ethylene glycol) (m-PEG). 

[0041] In a sixth aspect the invention provides a method of providing therapy to a subject 
15 in need of said therapy, wherein the method comprises, administering to said subject an 
effective amount a pharmaceutical composition of the invention. In one embodiment, the 
therapy provided is G-CSF therapy. In another embodiment the therapy provided is 
granulocyte macrophage colony stimulating factor therapy. In another embodiment the 
therapy provided is interferon alpha therapy. In still another embodiment the therapy 
20 provided is Growth Hormone therapy. 

[0042] Additional aspects, advantages and objects of the present invention will be apparent 
from the detailed description that follows. 



[0043] FIG. 1 is a plot of absorbance vs. GCSF concentration for unmodified G-CSF and 



BRIEF DESCRIPTION OF THE DRAWINGS 



25 



glyco-PEG-ylated analogues in a NSF-60 cell proliferation assay. 



[0044] 



FIG. 2 is a plot of counts per minute (CPM) vs. time for a rat pharmacokinetic 
study using radioiodinated G-CSF and glycol — PEG-lated derivatives thereof. 



[0045] 



FIG. 3 is a plot of |Lig/mL G-CSF in blood vs. time (h) for a rat pharmacokinetic 
study using radioiodinated G-CSF and glycol — PEG-lated derivatives thereof. 
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[0046] FIG. 4 is a plot showing the induction of white blood cells in mice using 
unmodified G-CSF and chemically- and glyco-PEG-ylated G-CSF. 

[0047] FIG. 5 is a plot of the results of an aggregation assay following radioiodination with 
the Bolton-Hunter reagent. 

5 [0048] FIG. 6 is a plot of the results of an accelerated stability study of glyco-PEG-ylated 

G-CSF. 

[0049] FIG. 7 is an expanded view of FIG. 6. 

[0050] FIG. 8 is a plot of the results of a rat IV pK Study using the Bolton Hunter 
radiolabeling process (precipitated plasma protein). 

1 0 [0051] FIG. 9 is a plot of the results of a rat IV pK Study using unlabeled G-CSF, 

chemically- and glyco-PEG-ylated G-CSF detected by ELISA. 

[0052] FIG. 10 showns representative sialyltransferases of use in the present invention. 



DETAILED DESCRIPTION OF THE INVENTION 



Abbreviations 



1 5 [0053] PEG, poly(ethyleneglycol); m-PEG, methoxy-poly(ethylene glycol); PPG, 
poly(propyleneglycol); m-PPG, methoxy-poly(propylene glycol); Fuc, fucosyl; Gal, 
galactosyl; GalNAc, N-acetylgalactosaminyl; Glc, glucosyl; GlcNAc, N-acetylglucosaminyl; 
Man, mannosyl; ManAc, mannosaminyl acetate; Sia, sialic acid; and NeuAc, N- 
acetylneur aminy 1 . 

20 Definitions 

[0054] Unless defined otherwise, all technical and scientific terms used herein generally have 
the same meaning as commonly understood by one of ordinary skill in the art to which this 
invention belongs. Generally, the nomenclature used herein and the laboratory procedures in 
cell culture, molecular genetics, organic chemistry and nucleic acid chemistry and 

25 hybridization are those well known and commonly employed in the art. Standard techniques 
are used for nucleic acid and peptide synthesis. The techniques and procedures are generally 
performed according to conventional methods in the art and various general references {see 
generally, Sambrook et ah Molecular Cloning: A Laboratory Manual, 2d ed. (1989) 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N. Y., which is incorporated 

30 herein by reference), which are provided throughout this document. The nomenclature used 
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herein and the laboratory procedures in analytical chemistry, and organic synthetic described 
below are those well known and commonly employed in the art. Standard techniques, or 
modifications thereof, are used for chemical syntheses and chemical analyses. 
[0055] All oligosaccharides described herein are described with the name or abbreviation for 
5 the non-reducing saccharide (7. e. , Gal), followed by the configuration of the glycosidic bond 
(a or P), the ring bond (1 or 2), the ring position of the reducing saccharide involved in the 
bond (2, 3, 4, 6 or 8), and then the name or abbreviation of the reducing saccharide (i.e., 
GlcNAc). Each saccharide is preferably a pyranose. For a review of standard glycobiology 
nomenclature see, Essentials of Glycobiology Varki et ah eds. CSHL Press (1999). 
10 [0056] Oligosaccharides are considered to have a reducing end and a non-reducing end, 

whether or not the saccharide at the reducing end is in fact a reducing sugar. In accordance 
with accepted nomenclature, oligosaccharides are depicted herein with the non-reducing end 
on the left and the reducing end on the right. 

[0057] The term "nucleic acid" or "polynucleotide" refers to deoxyribonucleic acids 
1 5 (DNA) or ribonucleic acids (RNA) and polymers thereof in either single- or double-stranded 
form. Unless specifically limited, the term encompasses nucleic acids containing known 
analogues of natural nucleotides that have similar binding properties as the reference nucleic 
acid and are metabolized in a manner similar to naturally occurring nucleotides. Unless 
otherwise indicated, a particular nucleic acid sequence also implicitly encompasses 
20 conservatively modified variants thereof (e.g., degenerate codon substitutions), alleles, 

orthologs, SNPs, and complementary sequences as well as the sequence explicitly indicated. 
Specifically, degenerate codon substitutions may be achieved by generating sequences in 
which the third position of one or more selected (or all) codons is substituted with mixed- 
base and/or deoxyinosine residues (Batzer et ah, Nucleic Acid Res. 19:5081 (1991); Ohtsuka 
25 et ah, J. Biol. Chem. 260:2605-2608 (1985); and Rossolini et ah, Moh Cell. Probes 8:91-98 
(1994)). The term nucleic acid is used interchangeably with gene, cDNA, and mRNA 
encoded by a gene. 

[0058] The term "gene" means the segment of DNA involved in producing a polypeptide 
chain. It may include regions preceding and following the coding region (leader and trailer) 
30 as well as intervening sequences (introns) between individual coding segments (exons). 

[0059] The term "isolated," when applied to a nucleic acid or protein, denotes that the 
nucleic acid or protein is essentially free of other cellular components with which it is 
associated in the natural state. It is preferably in a homogeneous state although it can be in 
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either a dry or aqueous solution. Purity and homogeneity are typically determined using 
analytical chemistry techniques such as polyacrylamide gel electrophoresis or high 
performance liquid chromatography. A protein that is the predominant species present in a 
preparation is substantially purified. In particular, an isolated gene is separated from open 
5 reading frames that flank the gene and encode a protein other than the gene of interest. The 
term "purified" denotes that a nucleic acid or protein gives rise to essentially one band in an 
electrophoretic gel. Particularly, it means that the nucleic acid or protein is at least 85% pure, 
more preferably at least 95% pure, and most preferably at least 99% pure. 

[0060] The term "amino acid" refers to naturally occurring and synthetic amino acids, as 
10 well as amino acid analogs and amino acid mimetics that function in a manner similar to the 
naturally occurring amino acids. Naturally occurring amino acids are those encoded by the 
genetic code, as well as those amino acids that are later modified, e.g., hydroxyproline, y- 
carboxyglutamate, and O-phosphoserine. Amino acid analogs refers to compounds that have 
the same basic chemical structure as a naturally occurring amino acid, I e. , an a carbon that is 
15 bound to a hydrogen, a carboxyl group, an amino group, and an R group, e.g., homoserine, 

norleucine, methionine sulfoxide, methionine methyl sulfonium. Such analogs have modified 
R groups {e.g., norleucine) or modified peptide backbones, but retain the same basic chemical 
structure as a naturally occurring amino acid. "Amino acid mimetics" refers to chemical 
compounds having a structure that is different from the general chemical structure of an 
20 amino acid, but that functions in a manner similar to a naturally occurring amino acid. 

[0061] There are various known methods in the art that permit the incorporation of an 
unnatural amino acid derivative or analog into a polypeptide chain in a site-specific manner, 
see, e.g., WO 02/086075. 

i 

[0062] Amino acids may be referred to herein by either the commonly known three letter 
25 symbols or by the one-letter symbols recommended by the IUPAC-IUB Biochemical 

Nomenclature Commission. Nucleotides, likewise, may be referred to by their commonly 
accepted single-letter codes. 

[0063] "Conservatively modified variants" applies to both amino acid and nucleic acid 
sequences. With respect to particular nucleic acid sequences, "conservatively modified 
30 variants" refers to those nucleic acids that encode identical or essentially identical amino acid 
sequences, or where the nucleic acid does not encode an amino acid sequence, to essentially 
identical sequences. Because of the degeneracy of the genetic code, a large number of 
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functionally identical nucleic acids encode any given protein. For instance, the codons GCA, 
GCC, GCG and GCU all encode the amino acid alanine. Thus, at every position where an 
alanine is specified by a codon, the codon can be altered to any of the corresponding codons 
described without altering the encoded polypeptide. Such nucleic acid variations are "silent 
5 variations/ 5 which are one species of conservatively modified variations. Every nucleic acid 
sequence herein that encodes a polypeptide also describes every possible silent variation of 
the nucleic acid. One of skill will recognize that each codon in a nucleic acid (except AUG, 
which is ordinarily the only codon for methionine, and TGG, which is ordinarily the only 
codon for tryptophan) can be modified to yield a functionally identical molecule. 
1 0 Accordingly, each silent variation of a nucleic acid that encodes a polypeptide is implicit in 
each described sequence. 

[0064] As to amino acid sequences, one of skill will recognize that individual substitutions, 
deletions or additions to a nucleic acid, peptide, polypeptide, or protein sequence which 
alters, adds or deletes a single amino acid or a small percentage of amino acids in the encoded 
1 5 sequence is a "conservatively modified variant" where the alteration results in the substitution 
of an amino acid with a chemically similar amino acid. Conservative substitution tables 
providing functionally similar amino acids are well known in the art. Such conservatively 
modified variants are in addition to and do not exclude polymorphic variants, interspecies 
homologs, and alleles of the invention. 

20 [0065] The following eight groups each contain amino acids that are conservative 
substitutions for one another: 



1) 


Alanine (A), Glycine (G); 


2) 


Aspartic acid (D), Glutamic acid (E); 


3) 


Asparagine (N), Glutamine (Q); 


4) 


Arginine (R), Lysine (K); 


5) 


Isoleucine (I), Leucine (L), Methionine (M), Valine (V); 


6) 


Phenylalanine (F), Tyrosine (Y), Tryptophan (W); 


7) 


Serine (S), Threonine (T); and 


8) 


Cysteine (C), Methionine (M) 



30 (see, e.g., Creighton, Proteins (1984)). 

[0066] Amino acids may be referred to herein by either their commonly known three letter 
symbols or by the one-letter symbols recommended by the IUPAC-IUB Biochemical 
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Nomenclature Commission. Nucleotides, likewise, may be referred to by their commonly 
accepted single-letter codes. 

[0067] In the present application, amino acid residues are numbered according to their 
relative positions from the most N-terminal residue, which is numbered 1 , in an unmodified 
5 wild-type polypeptide sequence. 

[0068] "Peptide" refers to a polymer in which the monomers are amino acids and are joined 
together through amide bonds. Peptides of the present invention can vary in size, e.g., from 
two amino acids to hundreds or thousands of amino acids, which alternatively is referred to as 
a polypeptide. Additionally, unnatural amino acids, for example, (3 -alanine, phenylglycine 

10 and homoarginine are also included. Amino acids that are not gene-encoded may also be 
used in the present invention. Furthermore, amino acids that have been modified to include 
reactive groups, glycosylation sites, polymers, therapeutic moieties, biomolecules and the like 
may also be used in the invention. All of the amino acids used in the present invention may 
be either the D - or L -isomer. The L -isomer is generally preferred. In addition, other 

1 5 peptidomimetics are also useful in the present invention. As used herein, "peptide" refers to 
both glycosylated and unglycosylated peptides. Also included are petides that are 
incompletely glycosylated by a system that expresses the peptide. For a general review, see, 

Spatola, A. F., in CHEMISTRY AND BIOCHEMISTRY OF AMINO ACIDS, PEPTIDES AND PROTEINS, 

B. Weinstein, eds., Marcel Dekker, New York, p. 267 (1983). 

20 [0069] In the present application, amino acid residues are numbered according to their 

relative positions from the N-terminal, e.g., the left most residue, which is numbered 1, in a 
peptide sequence. 

[0070] The term "mutant polypeptide" or "mutein" refers to a form of a peptide that differs 
from its corresponding wild-type form or naturally existing form. A mutant peptide can 
25 contain one or more mutations, e.g., replacement, insertion, deletion, etc. which result in the 
mutant peptide. 

[0071] The term "peptide conjugate," refers to species of the invention in which a peptide 
is glycoconjugated with a modified sugar as set forth herein. In a representative example, the 
peptide is a mutant peptide having an O-linked glycosylation site not present in the wild-type 
30 peptide. 

[0072] "Proximate a proline residue," as used herein refers to an amino acid that is less 
than about 1 0 amino acids removed from a proline residue, preferably, less than about 9, 8, 7, 
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6 or 5 amino acids removed from a proline residue, more preferably, less than about 4, 3, 2 or 
1 residues removed from a proline residue. The amino acid "proximate a proline residue" 
may be on the C- or N-terminal side of the proline residue. 

[0073] The term "sialic acid" refers to any member of a family of nine-carbon carboxylated 
5 sugars. The most common member of the sialic acid family is N-acetyl-neuraminic acid (2- 
keto-5-acetamido-3 ,5-dideoxy-D-glycero-D-galactononulopyranos- 1 -onic acid (often 
abbreviated as Neu5 Ac, NeuAc, or NANA). A second member of the family is N-glycolyl- 
neuraminic acid (Neu5Gc or NeuGc), in which the N-acetyl group of NeuAc is hydroxylated. 
A third sialic acid family member is 2-keto-3-deoxy-nonulosonic acid (KDN) (Nadano et ah 

10 (1986) J. Biol Chem. 261: 11550-11557; Kanamori et ah, J. Biol. Chem. 265: 21811-21819 
(1990)). Also included are 9-substituted sialic acids such as a 9-0-Ci-C6 acyl-Neu5Ac like 
9-O-lactyl-NeuSAc or 9-O-acetyl-NeuSAc, 9-deoxy-9-fluoro-Neu5Ac and 9-azido-9-deoxy- 
NeuSAc. For review of the sialic acid family, see, e.g., Varki, Glycobiology 2: 25-40 (1992); 
Sialic Acids: Chemistry, Metabolism and Function, R. Schauer, Ed. (Springer-Verlag, New 

15 York (1992)). The synthesis and use of sialic acid compounds in a sialylation procedure is 
disclosed in international application WO 92/16640, published October 1, 1992. 

[0074] As used herein, the term "modified sugar," refers to a naturally- or non-naturally- 
occurring carbohydrate that is enzymatically added onto an amino acid or a glycosyl residue 
of a peptide in a process of the invention. The modified sugar is selected from a number of 

20 enzyme substrates including, but not limited to sugar nucleotides (mono-, di-, and tri- 
phosphates), activated sugars (e.g., glycosyl halides, glycosyl mesylates) and sugars that are 
neither activated nor nucleotides. The "modified sugar" is covalently functionalized with a 
"modifying group." Useful modifying groups include, but are not limited to, water-soluble 
polymers, therapeutic moieties, diagnostic moieties, biomolecules and the like. The 

25 modifying group is preferably not a naturally occurring, or an unmodified carbohydrate. The 
locus of functionalization with the modifying group is selected such that it does not prevent 
the "modified sugar" from being added enzymatically to a peptide. 

« 

[0075] The term "water-soluble" refers to moieties that have some detectable degree of 
solubility in water. Methods to detect and/or quantify water solubility are well known in the 
30 art. Exemplary water-soluble polymers include peptides, saccharides, poly(ethers), 

poly(amines), poly(carboxylic acids) and the like. Peptides can have mixed sequences of be 
composed of a single amino acid, e.g., poly(lysine). An exemplary polysaccharide is 
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poly(sialic acid). An exemplary poly(ether) is poly(ethylene glycol), e.g., m-PEG. 
Poly(ethylene imine) is an exemplary polyamine, and poly(acrylic) acid is a representative 
poly(carboxylic acid). 

[0076] The polymer backbone of the water-soluble polymer can be poly(ethylene glycol) 
5 (i.e. PEG). However, it should be understood that other related polymers are also suitable for 
use in the practice of this invention and that the use of the term PEG or poly(ethylene glycol) 
is intended to be inclusive and not exclusive in this respect. The term PEG includes 
poly(ethylene glycol) in any of its forms, including alkoxy PEG, difunctional PEG, 
multiarmed PEG, forked PEG, branched PEG, pendent PEG (i.e. PEG or related polymers 
1 0 having one or more functional groups pendent to the polymer backbone), or PEG with 
degradable linkages therein. 

[0077] The polymer backbone can be linear or branched. Branched polymer backbones are 
generally known in the art. Typically, a branched polymer has a central branch core moiety 
and a plurality of linear polymer chains linked to the central branch core. PEG is commonly 

1 5 used in branched forms that can be prepared by addition of ethylene oxide to various polyols, 
such as glycerol, pentaerythritol and sorbitol. The central branch moiety can also be derived 
from several amino acids, such as lysine. The branched poly(ethylene glycol) can be 
represented in general form as R(-PEG-OH) m in which R represents the core moiety, such as 
glycerol or pentaerythritol, and m represents the number of arms. Multi-armed PEG 

20 molecules, such as those described in U.S. Pat. No. 5,932,462, which is incorporated by 
reference herein in its entirety, can also be used as the polymer backbone. 
[0078] Many other polymers are also suitable for the invention. Polymer backbones that 
are non-peptidic and water-soluble, with from 2 to about 300 termini, are particularly useful 
in the invention. Examples of suitable polymers include, but are not limited to, other 

25 poly(alkylene glycols), such as poly(propylene glycol) ("PPG"), copolymers of ethylene 

glycol and propylene glycol and the like, poly(oxyethylated polyol), poly(olefinic alcohol), 
poly(vinylpyrrolidone), poly(hydroxypropylmethacrylamide), poly(oc-hydroxy acid), 
poly(vinyl alcohol), polyphosphazene, polyoxazoline, poly(N-acryloylmorpholine), such as 
described in U.S. Pat. No. 5,629,384, which is incorporated by reference herein in its entirety, 

30 and copolymers, terpolymers, and mixtures thereof. Although the molecular weight of each 
chain of the polymer backbone can vary, it is typically in the range of from about 100 Da to 
about 100,000 Da, often from about 6,000 Da to about 80,000 Da. 
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[0079] The term "glycoconjugation," as used herein, refers to the enzymatically mediated 
conjugation of a modified sugar species to an amino acid or glycosyl residue of a 
polypeptide, e.g., a mutant human growth hormone of the present invention. A subgenus of 
"glycoconjugation" is "glycol-PEGylation," in which the modifying group of the modified 
5 sugar is poly (ethylene glycol), and alkyl derivative (e.g., m-PEG) or reactive derivative (e.g., 
H 2 N-PEG, HOOC-PEG) thereof. 

[0080] The terms "large-scale" and "industrial-scale" are used interchangeably and refer to 
a reaction cycle that produces at least about 250 mg, preferably at least about 500 mg, and 
more preferably at least about 1 gram of glycoconjugate at the completion of a single reaction 
10 cycle. 

[0081] The term, "glycosyl linking group," as used herein refers to a glycosyl residue to 
which a modifying group (e.g., PEG moiety, therapeutic moiety, biomolecule) is covalently 
attached; the glycosyl linking group joins the modifying group to the remainder of the 
conjugate. In the methods of the invention, the "glycosyl linking group" becomes covalently 

15 attached to a glycosylated or unglycosylated peptide, thereby linking the agent to an amino 
acid and/or glycosyl residue on the peptide. A "glycosyl linking group" is generally derived 
from a "modified sugar" by the enzymatic attachment of the "modified sugar" to an amino 
acid and/or glycosyl residue of the peptide. The glycosyl linking group can be a saccharide- 
derived structure that is degraded during formation of modifying group-modified sugar 

20 cassette (e.g., oxidation-»Schiff base formation-»reduction), or the glycosyl linking group 
may be intact. An "intact glycosyl linking group" refers to a linking group that is derived 
from a glycosyl moiety in which the saccharide monomer that links the modifying group and 
to the remainder of the conjugate is not degraded, e.g., oxidized, e.g., by sodium 
metaperiodate. "Intact glycosyl linking groups" of the invention may be derived from a 

25 naturally occurring oligosaccharide by addition of glycosyl unit(s) or removal of one or more 
glycosyl unit from a parent saccharide structure. 

[0082] The term "targeting moiety," as used herein, refers to species that will selectively 
localize in a particular tissue or region of the body. The localization is mediated by specific 
recognition of molecular determinants, molecular size of the targeting agent or conjugate, 
30 ionic interactions, hydrophobic interactions and the like. Other mechanisms of targeting an 
agent to a particular tissue or region are known to those of skill in the art. Exemplary 
targeting moieties include antibodies, antibody fragments, transferrin, HS-glycoprotein, 
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coagulation factors, serum proteins, p-glycoprotein, G-CSF, GM-CSF, M-CSF, EPO and the 
like. 

[0083] As used herein, "therapeutic moiety" means any agent useful for therapy including, 
but not limited to, antibiotics, anti-inflammatory agents, anti-tumor drugs, cytotoxins, and 
radioactive agents. "Therapeutic moiety" includes prodrugs of bioactive agents, constructs ir 
which more than one therapeutic moiety is bound to a carrier, e.g, multivalent agents. 
Therapeutic moiety also includes proteins and constructs that include proteins. Exemplary 
proteins include, but are not limited to, Erythropoietin (EPO), Granulocyte Colony 
Stimulating Factor (GCSF), Granulocyte Macrophage Colony Stimulating Factor (GMCSF), 
Interferon (e.g., Interferon-cc, -p, -y), Interleukin (e.g., Interleukin II), serum proteins (e.g., 
Factors VII, Vila, VIII, IX, and X), Human Chorionic Gonadotropin (HCG), Follicle 
Stimulating Hormone (FSH) and Lutenizing Hormone (LH) and antibody fusion proteins 
(e.g. Tumor Necrosis Factor Receptor ((TNFR)/Fc domain fusion protein)). 
[0084] As used herein, "anti-tumor drug" means any agent useful to combat cancer 
including, but not limited to, cytotoxins and agents such as antimetabolites, alkylating agents, 
anthracyclines, antibiotics, antimitotic agents, procarbazine, hydroxyurea, asparaginase, 
corticosteroids, interferons and radioactive agents. Also encompassed within the scope of the 
term "anti-tumor drug," are conjugates of peptides with anti-tumor activity, e.g. TNF-a. 
Conjugates include, but are not limited to those formed between a therapeutic protein and a 
glycoprotein of the invention. A representative conjugate is that formed between PSGL-1 
and TNF-a. 

[0085] As used herein, "a cytotoxin or cytotoxic agent" means any agent that is detrimental 
to cells. Examples include taxol, cytochalasin B, gramicidin D, ethidium bromide, emetine, 
mitomycin, etoposide, tenoposide, vincristine, vinblastine, colchicin, doxorubicin, 
daunorubicin, dihydroxy anthracinedione, mitoxantrone, mithramycin, actinomycin D, 1 - 
dehydrotestosterone, glucocorticoids, procaine, tetracaine, lidocaine, propranolol, and 
puromycin and analogs or homologs thereof. Other toxins include, for example, ricin, CC- 
1065 and analogues, the duocarmycins. Still other toxins include diptheria toxin, and snake 
venom (e.g., cobra venom). 

[0086] As used herein, "a radioactive agent" includes any radioisotope that is effective in 
diagnosing or destroying a tumor. Examples include, but are not limited to, indium- 1 11, 
cobalt-60. Additionally, naturally occurring radioactive elements such as uranium, radium, 
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and thorium, which typically represent mixtures of radioisotopes, are suitable examples of a 
radioactive agent. The metal ions are typically chelated with an organic chelating moiety. 

[0087] Many useful chelating groups, crown ethers, cryptands and the like are known in the 
art and can be incorporated into the compounds of the invention (e.g., EDTA, DTP A, DOTA, 
5 NTA, HDTA, etc. and their phosphonate analogs such as DTPP, EDTP, HDTP, NTP, etc). 
See, for example, Pitt et al., "The Design of Chelating Agents for the Treatment of Iron 
Overload," In, Inorganic Chemistry in Biology and Medicine; Martell, Ed.; American 
Chemical Society, Washington, D.C., 1980, pp. 279-312; Lindoy, The Chemistry of 
Macrocyclic Ligand Complexes; Cambridge University Press, Cambridge, 1989; Dugas, 
10 BIOORGANIC Chemistry; Springer- Verlag, New York, 1989, and references contained 
therein. 

[0088] Additionally, a manifold of routes allowing the attachment of chelating agents, 
crown ethers and cyclodextrins to other molecules is available to those of skill in the art. See, 
for example, Meares et al, "Properties of In Vivo Chelate-Tagged Proteins and 
15 Polypeptides." In, Modification of Proteins: Food, Nutritional, and 

Pharmacological Aspects;" Feeney, et al, Eds., American Chemical Society, 
Washington, D.C., 1982, pp. 370-387; Kasina et al, Bioconjugate Chem., 9: 108-117 (1998); 
Song etal, Bioconjugate Chem., 8: 249-255 (1997). 

[0089] As used herein, "pharmaceutically acceptable carrier" includes any material, which 
20 when combined with the conjugate retains the conjugates' activity and is non-reactive with 
the subject's immune systems. Examples include, but are not limited to, any of the standard 
pharmaceutical carriers such as a phosphate buffered saline solution, water, emulsions such 
as oil/water emulsion, and various types of wetting agents. Other carriers may also include 
sterile solutions, tablets including coated tablets and capsules. Typically such carriers contain 
25 excipients such as starch, milk, sugar, certain types of clay, gelatin, stearic acid or salts 

thereof, magnesium or calcium stearate, talc, vegetable fats or oils, gums, glycols, or other 
known excipients. Such carriers may also include flavor and color additives or other 
ingredients. Compositions comprising such carriers are formulated by well known 
conventional methods. 

30 [0090] As used herein, "administering" means oral administration, administration as a 
suppository, topical contact, intravenous, intraperitoneal, intramuscular, intralesional, or 
subcutaneous administration, administration by inhalation, or the implantation of a slow- 
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release device, e.g., a, mini-osmotic pump, to the subject. Adminsitration is by any route 
including parenteral and transmucosal (e.g., oral, nasal, vaginal, rectal, or transdermal), 
particularly by inhalation. Parenteral administration includes, e.g., intravenous, 
intramuscular, intra-arteriole, intradermal, subcutaneous, intraperitoneal, intraventricular, and 
5 intracranial. Moreover, where injection is to treat a tumor, e.g., induce apoptosis, 

administration may be directly to the tumor and/or into tissues surrounding the tumor. Other 
modes of delivery include, but are not limited to, the use of liposomal formulations, 
intravenous infusion, transdermal patches, etc. 

[0091] The term "ameliorating" or "ameliorate" refers to any indicia of success in the 
10 treatment of a pathology or condition, including any objective or subjective parameter such as 
abatement, remission or diminishing of symptoms or an improvement in a patient's physical 
or mental well-being. Amelioration of symptoms can be based on objective or subjective 
parameters; including the results of a physical examination and/or a psychiatric evaluation. 

[0092] The term "therapy" refers to "treating" or "treatment" of a disease or condition 
1 5 including preventing the disease or condition from occurring in an animal that may be 

predisposed to the disease but does not yet experience or exhibit symptoms of the disease 
(prophylactic treatment), inhibiting the disease (slowing or arresting its development), 
providing relief from the symptoms or side-effects of the disease (including palliative 
treatment), and relieving the disease (causing regression of the disease). 

20 [0093] The term "effective amount" or "an amount effective to"or a "therapeutically 
effective amount" or any gramatically equivalent term means the amount that, when 
administered to an animal for treating a disease, is sufficient to effect treatment for that 
disease. 

[0094] The term "isolated" refers to a material that is substantially or essentially free from 
25 components, which are used to produce the material. For peptide conjugates of the invention, 
the term "isolated" refers to material that is substantially or essentially free from components, 
which normally accompany the material in the mixture used to prepare the peptide conjugate. 
"Isolated" and "pure" are used interchangeably. Typically, isolated peptide conjugates of the 
invention have a level of purity preferably expressed as a range. The lower end of the range 
30 of purity for the peptide conjugates is about 60%, about 70% or about 80% and the upper end 
of the range of purity is about 70%, about 80%, about 90% or more than about 90%. 
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[0095] When the peptide conjugates are more than about 90% pure, their purities are also 
preferably expressed as a range. The lower end of the range of purity is about 90%, about 
92%, about 94%, about 96% or about 98%. The upper end of the range of purity is about 
92%, about 94%, about 96%, about 98% or about 100% purity. 

5 [0096] Purity is determined by any art-recognized method of analysis (e.g., band intensity 
on a silver stained gel, polyacrylamide gel electrophoresis, HPLC, or a similar means). 

[0097] "Essentially each member of the population," as used herein, describes a 
characteristic of a population of peptide conjugates of the invention in which a selected 
percentage of the modified sugars added to a peptide are added to multiple, identical acceptor 
10 sites on the peptide. "Essentially each member of the population" speaks to the 

"homogeneity" of the sites on the peptide conjugated to a modified sugar and refers to 
conjugates of the invention, which are at least about 80%, preferably at least about 90% and 
more preferably at least about 95% homogenous. 

[0098] "Homogeneity," refers to the structural consistency across a population of acceptor 
15 moieties to which the modified sugars are conjugated. Thus, in a peptide conjugate of the 
' invention in which each modified sugar moiety is conjugated to an acceptor site having the 
same structure as the acceptor site to which every other modified sugar is conjugated, the 
peptide conjugate is said to be about 100% homogeneous. Homogeneity is typically 
expressed as a range. The lower end of the range of homogeneity for the peptide conjugates 
20 is about 60%, about 70% or about 80% and the upper end of the range of purity is about 70%, 
about 80%, about 90% or more than about 90%. 

[0099] When the peptide conjugates are more than or equal to about 90% homogeneous, 
their homogeneity is also preferably expressed as a range. The lower end of the range of 
homogeneity is about 90%, about 92%, about 94%, about 96% or about 98%. The upper end 
25 of the range of purity is about 92%, about 94%, about 96%, about 98% or about 100% 

homogeneity. The purity of the peptide conjugates is typically determined by one or more 
methods known to those of skill in the art, e.g., liquid chromatography-mass spectrometry 
(LC-MS), matrix assisted laser desorption mass time of flight spectrometry (MALDITOF), 
capillary electrophoresis, and the like. 

30 [0100] "Substantially uniform glycoform" or a "substantially uniform glycosylation 
pattern," when referring to a glycopeptide species, refers to the percentage of acceptor 
moieties that are glycosylated by the glycosyltransferase of interest (e.g., fucosyltransferase). 
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For example, in the case of a ccl,2 fucosyltransferase, a substantially uniform fucosylation 
pattern exists if substantially all (as defined below) of the Gaipi,4-GlcNAc-R and sialylated 
analogues thereof are fucosylated in a peptide conjugate of the invention. It will be 
understood by one of skill in the art, that the starting material may contain glycosylated 
5 acceptor moieties (e.g., fucosylated Gaipi,4-GlcNAc-R moieties). Thus, the calculated 

percent glycosylation will include acceptor moieties that are glycosylated by the methods of 
the invention, as well as those acceptor moieties already glycosylated in the starting material. 

[0101] The term "substantially" in the above definitions of "substantially uniform" 
generally means at least about 40%, at least about 70%, at least about 80%, or more 
10 preferably at least about 90%, and still more preferably at least about 95% of the acceptor 
moieties for a particular glycosyltransferase are glycosylated. 

[0102] Where substituent groups are specified by their conventional chemical formulae, 
written from left to right, they equally encompass the chemically identical substituents, which 
would result from writing the structure from right to left, e.g., ~CH 2 0- is intended to also 
15 recite -OCH2-. 

[0103] The term "alkyl," by itself or as part of another substituent means, unless otherwise 
stated, a straight or branched chain, or cyclic hydrocarbon radical, or combination thereof, 
which may be fully saturated, mono- or polyunsaturated and can include di~ and multivalent 
radicals, having the number of carbon atoms designated (i.e. C1-C10 means one to ten 

20 carbons). Examples of saturated hydrocarbon radicals include, but are not limited to, groups 
such as methyl, ethyl, n-propyl, isopropyl, n-butyl, t-butyl, isobutyl, sec-butyl, cyclohexyl, 
(cyclohexyl)methyl, cyclopropylmethyl, homologs and isomers of, for example, n-pentyl, n- 
hexyl, n-heptyl, n-octyl, and the like. An unsaturated alkyl group is one having one or more 
double bonds or triple bonds. Examples of unsaturated alkyl groups include, but are not 

25 limited to, vinyl, 2-propenyl, crotyl, 2-isopentenyl, 2-(butadienyl), 2,4-pentadienyl, 3 -(1,4- 
pentadienyl), ethynyl, 1- and 3-propynyl, 3-butynyl, and the higher homologs and isomers. 
The term "alkyl," unless otherwise noted, is also meant to include those derivatives of alkyl 
defined in more detail below, such as "heteroalkyl." Alkyl groups that are limited to 
hydrocarbon groups are termed "homoalkyl". 

30 [0104] The term "alkylene" by itself or as part of another substituent means a divalent radical 
derived from an alkane, as exemplified, but not limited, by -CH2CH2CH2CH2-, and further 
includes those groups described below as "heteroalkylene." Typically, an alkyl (or alkylene) 
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group will have from 1 to 24 carbon atoms, with those groups having 10 or fewer carbon 
atoms being preferred in the present invention. A "lower alkyl" or "lower alkylene" is a 
shorter chain alkyl or alkylene group, generally having eight or fewer carbon atoms. 
[0105] The terms "alkoxy," "alkylamino" and "alkylthio" (or thioalkoxy) are used in their 
5 conventional sense, and refer to those alkyl groups attached to the remainder of the molecule 
via an oxygen atom, an amino group, or a sulfur atom, respectively. 

[0106] The term "heteroalkyl," by itself or in combination with another term, means, unless 
otherwise stated, a stable straight or branched chain, or cyclic hydrocarbon radical, or 
combinations thereof, consisting of the stated number of carbon atoms and at least one 

10 heteroatom selected from the group consisting of O, N, Si and S, and wherein the nitrogen 
and sulfur atoms may optionally be oxidized and the nitrogen heteroatom may optionally be 
quaternized. The heteroatom(s) O, N and S and Si may be placed at any interior position of 
the heteroalkyl group or at the position at which the alkyl group is attached to the remainder 
of the molecule. Examples include, but are not limited to, -CH2-CH2-O-CH3, -CH 2 -CH 2 -NH- 

1 5 CH 3 , -CH 2 -CH 2 -N(CH 3 )-CH 3 , -CH 2 -S-CH 2 -CH 3 , -CH 2 -CH 2 ,-S(0)-CH 3 , ~CH 2 -CH 2 -S(0) 2 - 
CH 3 , -CH=CH-0-CH 3 , ~Si(CH 3 ) 3 , -CH 2 -CH=N-OCH 3 , and -CH=CH-N(CH 3 )-CH 3 . Up to 
two heteroatoms may be consecutive, such as, for example, ~CH 2 -NH-OCH 3 and -CH 2 -0~ 
Si(CH 3 ) 3 . Similarly, the term "heteroalkylene" by itself or as part of another substituent 
means a divalent radical derived from heteroalkyl, as exemplified, but not limited by, -CH2- 

20 CH2-S-CH2-CH2- and -CH 2 -S-CH 2 -CH 2 -NH-CH 2 -. For heteroalkylene groups, heteroatoms 
can also occupy either or both of the chain termini (e.g., alkyleneoxy, alkylenedioxy, 
alkyleneamino, alkylenediamino, and the like). Still further, for alkylene and heteroalkylene 
linking groups, no orientation of the linking group is implied by the direction in which the 
formula of the linking group is written. For example, the formula -C(0) 2 R'- represents both 

25 -C(0) 2 R'- and -R 5 C(0) 2 -. 

[0107] The terms "cycloalkyl" and "heterocycloalkyl", by themselves or in combination with 
other terms, represent, unless otherwise stated, cyclic versions of "alkyl" and "heteroalkyl", 
respectively. Additionally, for heterocycloalkyl, a heteroatom can occupy the position at 
which the heterocycle is attached to the remainder of the molecule. Examples of cycloalkyl 

30 include, but are not limited to, cyclopentyl, cyclohexyl, 1 -cyclohexenyl, 3-cyclohexenyl, 
cycloheptyl, and the like. Examples of heterocycloalkyl include, but are not limited to, 1 - 
(1,2,5,6-tetrahydropyridyl), 1 -piperidinyl, 2-piperidinyl, 3-piperidinyl, 4-morpholinyl, 3- 
morpholinyl, tetrahydrofuran-2-yl, tetrahydrofuraa-3-yl, tetrahydrothien-2-yl, 
tetrahydrothien-3-yl, 1 -piperazinyl, 2-piperazinyl, and the like. 
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[0108] The terms "halo" or "halogen/ 5 by themselves or as part of another substituent, mean, 
unless otherwise stated, a fluorine, chlorine, bromine, or iodine atom. Additionally, terms 
such as "haloalkyl," are meant to include monohaloalkyl and polyhaloalkyl. For example, the 
term "halo(Ci-C4)alkyl" is mean to include, but not be limited to, trifluoromethyl, 2,2,2- 
5 trifluoroethyl, 4-chlorobutyl, 3-bromopropyl, and the like. 

[0109] The term "aryl" means, unless otherwise stated, a polyunsaturated, aromatic, 
substituent that can be a single ring or multiple rings (preferably from 1 to 3 rings), which are 
fused together or linked covalently. The term "heteroaryl" refers to aryl groups (or rings) that 
contain from one to four heteroatoms selected from N, O, and S, wherein the nitrogen and 

1 0 sulfur atoms are optionally oxidized, and the nitrogen atom(s) are optionally quaternized. A 
heteroaryl group can be attached to the remainder of the molecule through a heteroatom. 
Non-limiting examples of aryl and heteroaryl groups include phenyl, 1 -naphthyl, 2-naphthyl, 
4-biphenyl, 1-pyrrolyl, 2-pyrrolyl, 3-pyrrolyl, 3-pyrazolyl, 2-imidazolyl, 4-imidazolyl, 
pyrazinyl, 2-oxazolyl, 4-oxazolyl, 2-phenyl-4-oxazolyl, 5-oxazolyl, 3-isoxazolyl, 4- 

15 isoxazolyl, 5-isoxazolyl, 2-thiazolyl, 4-thiazolyl, 5-thiazolyl, 2-furyl, 3-furyl, 2-thienyl, 3- 
thienyl, 2-pyridyl, 3-pyridyl, 4-pyridyl, 2-pyrimidyl, 4-pyrimidyl, 5-benzothiazolyl, purinyl, 
2-benzimidazolyl, 5-indolyl, 1 -isoquinolyl, 5-isoquinolyl, 2-quinoxalinyl, 5-quinoxalinyl, 3- 
quinolyl, tetrazolyl, benzo[b]furanyl, benzo[b]thienyl, 2,3 -dihydrobenzo [ 1 ,4] dioxin-6-yl, 
benzo[l,3]dioxol-5-yl and 6-quinolyl. Substituents for each of the above noted aryl and 

20 heteroaryl ring systems are selected from the group of acceptable substituents described 
below. 

[0110] For brevity, the term "aryl" when used in combination with other terms (e.g., aryloxy, 
arylthioxy, arylalkyl) includes both aryl and heteroaryl rings as defined above. Thus, the 
term "arylalkyl" is meant to include those radicals in which an aryl group is attached to an 
25 alkyl group (e.g., benzyl, phenethyl, pyridylmethyl and the like) including those alkyl groups 
in which a carbon atom (e.g., a methylene group) has been replaced by, for example, an 
oxygen atom (e.g., phenoxymethyl, 2-pyridyloxymethyl, 3-(l-naphthyloxy)propyl, and the 
like). 

[0111] Each of the above terms (e.g., "alkyl," "heteroalkyl," "aryl" and "heteroaryl") is 
30 meant to include both substituted and unsubstituted forms of the indicated radical. Preferred 
substituents for each type of radical are provided below. 

[0112] Substituents for the alkyl and heteroalkyl radicals (including those groups often 
referred to as alkylene, alkenyl, heteroalkylene, heteroalkenyl, alkynyl, cycloalkyl, 
heterocycloalkyl, cycloalkenyl, and heterocycloalkenyl) are generically referred to as "alkyl 
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group substituents," and they can be one or more of a variety of groups selected from, but not 
limited to: -OR\ =0, =NR', =N-OR', -NR'R", -SR', -halogen, -SiR'R"R"', -OC(0)R\ - 
C(0)R', ~C0 2 R', -CONR'R", -OC(0)NR'R", -NR"C(0)R\ -NR'-C(0)NR"R"', - 
NR"C(0) 2 R', -NR-C(NR'R"R'")==NR"", -NR-C(NR'R")=NR'", -S(0)R', -S(0) 2 R', - 
5 S(0) 2 NR'R", -NRS0 2 R', -CN and -N0 2 in a number ranging from zero to (2m'+l), where 
m' is the total number of carbon atoms in such radical. R', R", R"' and R"" each preferably 
independently refer to hydrogen, substituted or unsubstituted heteroalkyl, substituted or 
unsubstituted aryl, e.g., aryl substituted with 1-3 halogens, substituted or unsubstituted alkyl, 
alkoxy or thioalkoxy groups, or aryl alkyl groups. When a compound of the invention 

1 0 includes more than one R group, for example, each of the R groups is independently selected 
as are each R', R", R'" and R"" groups when more than one of these groups is present. When 
R' and R" are attached to the same nitrogen atom, they can be combined with the nitrogen 
atom to form a 5-, 6-, or 7-membered ring. For example, -NR'R" is meant to include, but not 
be limited to, 1 -pyrrolidinyl and 4-morpholinyl. From the above discussion of substituents, 

1 5 one of skill in the art will understand that the term "alkyl" is meant to include groups 

including carbon atoms bound to groups other than hydrogen groups, such as haloalkyl {e.g., 
-CF 3 and -CH 2 CF 3 ) and acyl {e.g., -C(0)CH 3 , -C(0)CF 3 , -C(0)CH 2 OCH 3 , and the like). 
[0113] Similar to the substituents described for the alkyl radical, substituents for the aryl and 
heteroaryl groups are generically referred to as "aryl group substituents." The substituents 

20 are selected from, for example: halogen, -OR', =0, =NR', =N-OR', -NR'R", -SR% -halogen, 
-SiR'R"R"', -OC(0)R', -C(0)R', -C0 2 R', -CONR'R", -OC(0)NR'R", -NR"C(0)R', 
-NR'-C(0)NR"R"', -NR"C(0) 2 R', -NR-C(NR'R"R'")=NR"", -NR-C(NR'R")=NR"', - 
S(0)R', -S(0) 2 R', -S(0) 2 NR'R", -NRS0 2 R', -CN and-N0 2 , -R', -N 3 , -CH(Ph) 2 , fluoro(C r 
C4)alkoxy, and fluoro(Ci-C 4 )alkyl, in a number ranging from zero to the total number of open 

25 valences on the aromatic ring system; and where R', R", R"' and R"" are preferably 

independently selected from hydrogen, substituted or unsubstituted alkyl, substituted or 
unsubstituted heteroalkyl, substituted or unsubstituted aryl and substituted or unsubstituted 
heteroaryl. When a compound of the invention includes more than one R group, for example, 
each of the R groups is independently selected as are each R', R", R'" and R"" groups when 

30 more than one of these groups is present. In the schemes that follow, the symbol X 
represents "R" as described above. 

[0114] Two of the substituents on adjacent atoms of the aryl or heteroaryl ring may 
optionally be replaced with a substituent of the formula -T-C(0)-(CRR') q -U-, wherein T and 
U are independently -NR-, -O-, -CRR'- or a single bond, and q is an integer of from 0 to 3. 

26 



WO 2005/070138 PCT/US2005/000799 

Alternatively, two of the substituents on adjacent atoms of the aryl or heteroaryl ring may 
optionally be replaced with a substituent of the formula — A-(CH2) r -B- 3 wherein A and B are 
independently -CRR'-, -O-, -NR-, -S-, -S(O)-, -S(0) 2 -, -S(0) 2 NR'- or a single bond, and r is 
an integer of from 1 to 4. One of the single bonds of the new ring so formed may optionally 
5 be replaced with a double bond. Alternatively, two of the substituents on adjacent atoms of 
the aryl or heteroaryl ring may optionally be replaced with a substituent of the formula — 
(CRR 5 ) s -X-(CR"R"')d-> where s and d are independently integers of from 0 to 3, and X is -O- 
, -NRX -S-, -S(O)-, -S(0) 2 -, or -S(0) 2 NR'-. The substituents R, R', R" and R'" are 
preferably independently selected from hydrogen or substituted or unsubstituted (Ci-C6)alkyl. 
10 [0115] As used herein, the term "heteroatom" is meant to include oxygen (O), nitrogen (N), 
sulfur (S) and silicon (Si). 

j 

Introduction 

[0116] The present invention provides conjugates of glycopeptides in which a modified 
sugar moiety is attached either directly or indirectly (e.g., through and intervening glycosyl 
15 residue) to an O-linked glycosylation site on the peptide. Also provided are methods for 
producing the conjugates of the invention. 

[0117] The O-linked glycosylation site is generally the hydroxy side chain of a natural 
(e.g., serine, threonine) or unnatural (e.g., 5-hydroxyproline or 5-hydroxylysine) amino acid. 
Exemplary O-linked saccharyl residues include N-acetylgalactosamine, galactose, mannose, 
20 GlcNAc, glucose, fucose or xylose. 

[0118] The methods of the invention can be practiced on any peptide having an O-linked 
glycosylation site. For example, the methods are of use to produce O-linked glycoconjugates 
in which the glycosyl moiety is attached to an O-linked glycosylation site that is present in 
the wild type peptide. Accordingly, the present invention provides glycoconjugates of wild- 
25 type peptides that include an O-linked glycosylation site. Exemplary peptides according to 
this description include G-CSF, GM-CSF, IL-2 and interferon. 

[0119] In exemplary embodiments the invention also provides novel mutant peptides that 
include one or more O-linked glycosylation sites that are not present in the corresponding 
wild-type peptide. In one embodiment the mutant polypeptide is a G-CSF polypeptide. In 
30 other exemplary embodiments the mutant polypeptide is an hGH polypeptide, an IFN alpha 
polypeptide or a GM-CSF polypeptide. Also provided are O-linked glycosylated versions of 
the mutant peptides, and methods of preparing O-linked glycosylated mutant peptides. 
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Additional methods include the elaboration, trimming back and/or modification of the O 
linked glycosyl residue and glycosyl residues that are N-, rather than O-linked. 

[0120] In an exemplary aspect, the invention provides a mutant peptide having the formula: 

vAAP \S\J\n 

I I 

AA— O— GalNAc — X ; and AA — O — GalNAc — X 



5 in which AA is an amino acid with a side chain that includes a hydroxyl moiety. Exemplary 
hydroxyamino acids are threonine and serine. The GalNAc moiety is linked to AA through 
the oxygen atom of the hydroxyl moiety. AA may be present in the wild type peptide or, 
alternatively, it is added or relocated by mutating the sequence of the wild type peptide. X is 
a modifying group, a saccharyl moiety, e.g., sialyl, galactosyl and Gal-Sia groups, or a 
10 saccharyl moiety and a modifying group. In an exemplary embodiment, in which X is a 
saccharyl moiety, it includes a modifying group, as discussed herein. The glycosylated 
amino acid can be at the N- or C -peptide terminus or internal to the peptide sequence. 

[0121] In an exemplary embodiment, X comprises a group selected from sialyl, galactosyl 
and Gal-Sia moieties, wherein at least one of said sialyl, galactosyl and Gal-Sia comprises a 
1 5 modifying group. In a further exemplary embodiment X comprises the moiety: 



COOH 




G— HN 



OH 



wherein D is a member selected from -OH and R^L-HN-jG is a member selected from R^L- 
and -C(0)(Ci-C6)alkyl; R 1 is a moiety comprising a member selected a moiety comprising a 
straight-chain or branched poly(ethylene glycol) residue; and L is a linker which is a member 
20 selected from a bond, substituted or unsubstituted alkyl and substituted or unsubstituted 
heteroalkyl, such that when D is OH, 0 is R^L-, and when G is -C(0)(Ci-C 6 )alkyl, D is 
R^L-NH-. 

[0122] In another exemplary embodiment X comprises the structure: 
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HOH 2 C 




'O 



n 



in which L is a substituted or unsubstituted alkyl or substituted or unsubstituted heteroalkyl 
group; and n is selected from the integers from 0 to about 2500. In yet another exemplary 
embodiment X comprises the structure: 



in which s is selected from the integers from 0 to 20, 

[0123] In another exemplary embodiment, AA is located within a proline-rich segment of 
the mutant peptide and/or it is proximate to a proline residue. Appropriate sequences forming 
O-linked glycosylation sites are readily determined by interrogating the enzymatic O-linked 
glycosylation of short peptides containing one or more putative O-linked glycosylation sites. 

[0124] The conjugates of the invention are formed between peptides and diverse species 
such as water-soluble polymers, therapeutic moieties, diagnostic moieties, targeting moieties 
and the like. Also provided are conjugates that include two or more peptides linked together 
through a linker arm, i.e., multifunctional conjugates; at least one peptide being O- 
glycosylated or including a mutant O-linked glycosylation site. The multi-functional 
conjugates of the invention can include two or more copies of the same peptide or a 
collection of diverse peptides with different structures, and/or properties. In exemplary 
conjugates according to this embodiment, the linker between the two peptides is attached to 
at least one of the peptides through an O-linked glycosyl residue, such as an O-linked 
glycosyl intact glycosyl linking group. 




n 



29 



WO 2005/070138 PCT/US2005/000799 

[0125] The conjugates of the invention are formed by the enzymatic attachment of a 
modified sugar to the glycosylated or unglycosylated peptide. The modified sugar is directly 
added to an O-linked glycosylation site, or to a glycosyl residue attached either directly or 
indirectly (e.g., through one or more glycosyl residue) to an O-linked glycosylation site. The 
5 invention also provides a conjugate of an O-linked glycosylated peptide in which a modified 
sugar is directly attached to an N-linked site, or to a glycosyl residue attached either directly 
or indirectly to an N-linked glycosylation site. 

[0126] The modified sugar, when interposed between the peptide (or glycosyl residue) and 
the modifying group on the sugar becomes what is referred to herein as "an intact glycosyl 

10 linking group." Using the exquisite selectivity of enzymes, such as glycosyltransferases, the 
present method provides peptides that bear a desired group at one or more specific locations. 
Thus, according to the present invention, a modified sugar is attached directly to a selected 
locus on the peptide chain or, alternatively, the modified sugar is appended onto a 
carbohydrate moiety of a glycopeptide. Peptides in which modified sugars are bound to both 

15 a glycopeptide carbohydrate and directly to an amino acid residue of the peptide backbone 
are also within the scope of the present invention. 

[0127] In contrast to known chemical and enzymatic peptide elaboration strategies, the 
methods of the invention, make it possible to assemble peptides and glycopeptides that have a 
substantially homogeneous derivatization pattern; the enzymes used in the invention are 

20 generally selective for a particular amino acid residue or combination of amino acid residues 
of the peptide. The methods are also practical for large-stale production of modified peptides 
and glycopeptides. Thus, the methods of the invention provide a practical means for large- 
scale preparation of glycopeptides having preselected uniform derivatization patterns. The 
methods are particularly well suited for modification of therapeutic peptides, including but 

25 not limited to, glycopeptides that are incompletely glycosylated during production in cell 
culture cells (e.g., mammalian cells, insect cells, plant cells, fungal cells, yeast cells, or 
prokaryotic cells) or transgenic plants or animals. / 

[0128] The methods of the invention also provide conjugates of glycosylated and 
unglycosylated peptides with increased therapeutic half-life due to, for example, reduced 
30 clearance rate, or reduced rate of uptake by the immune or reticuloendothelial system (RES). 
Moreover, the methods of the invention provide a means for masking antigenic determinants 
on peptides, thus reducing or eliminating a host immune response against the peptide. 
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Selective attachment of targeting agents to a peptide using an appropriate modified sugar can 
also be used to target a peptide to a particular tissue or cell surface receptor that is specific for 
the particular targeting agent. Moreover, there is provided a class of peptides that are 
specifically modified with a therapeutic moiety conjugated through a glycosyl linking group. 

5 O-Glycosylation 

[0129] The present invention provides O-linked glycosylated peptides, conjugates of these 
species and methods for forming O-linked glycosylated peptides that include a selected 
amino acid sequence ("an O-linked glycosylation site"). Of particular interest are mutant 
peptides that include an O-linked glycosylation site that is not present in the corresponding 
1 0 wild type peptide. The O-linked glycosylation site is a locus for attachment of a glycosyl 
residue that bears a modifying group. 

[0130] Mucin-type O-linked glycosylation, one of the most abundant forms of protein 
glycosylation, is found on secreted and cell surface associated glycoproteins of all eukaryotic 
cells. There is great diversity in the structures created by O-linked glycosylation (hundreds 

15 of potential structures), which are produced by the catalytic activity of hundreds of 

glycosyltransferase enzymes that are resident in the Golgi complex. Diversity exists at the 
level of the glycan structure and in positions of attachment of O-glycans to protein 
backbones. Despite the high degree of potential diversity, it is clear that O-linked 
glycosylation is a highly regulated process that shows a high degree of conservation among 

20 multicellular organisms. 

[0131] The first step in mucin-type O-linked glycosylation is catalysed by one or more 
members of a large family of UDP-GalNAc: polypeptide N-acetylgalactosaminyltransferases 
(GalNAc-transferases) (EC 2.4.1.41), which transfer GalNAc to serine and threonine acceptor 
sites (Hassan et al., J. Biol Chem. 275: 38197-38205 (2000)). To date twelve members of 

25 the mammalian GalNAc-transferase family have been identified and characterized 

(Schwientek et al., J. Biol. Chem. 277: 22623-22638 (2002)), and several additional putative 
members of this gene family have been predicted from analysis of genome databases. The 
GalNAc-transferase isoforms have different kinetic properties and show differential 
expression patterns temporally and spatially, suggesting that they have distinct biological 

30 functions (Hassan et al., J. Biol Chem. 275: 38197-38205 (2000)). Sequence analysis of 
GalNAc-transferases have led to the hypothesis that these enzymes contain two distinct 
subunits: a central catalytic unit, and a C-terminal unit with sequence similarity to the plant 
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lectin ricin, designated the "lectin domain" (Hagen et al., J. Biol. Chem. 274: 6797-6803 
(1999); Hazes, Protein Eng. 10: 1353-1356 (1997); Breton et al., Curr. Opin. Sfruct. Biol 9: 
563-571 (1999)). Previous experiments involving site-specific mutagenesis of selected 
conserved residues confirmed that mutations in the catalytic domain eliminated catalytic 
5 activity. In contrast, mutations in the "lectin domain" had no significant effects on catalytic 
activity of the GalNAc-transferase isoform, GalNAc-Tl (Tenno et al,J. Biol Chem. 
277(49): 47088-96 (2002)). Thus, the C-terminal "lectin domain" was believed not to be 
functional and not to play roles for the enzymatic functions of GalNAc-transferases (Hagen et 
al., J. Biol. Chem. 274: 6797-6803 (1999)). 

10 [0132] However, recent evidence demonstrates that some GalNAc-transferases exhibit 

unique activities with partially GalNAc-glycosylated glycopeptides. The catalytic actions of 
at least three GalNAc-transferase isoforms, GalNAc-T4, -T7, and -T10, selectively act on 
glycopeptides corresponding to mucin tandem repeat domains where only some of the 
clustered potential glycosylation sites have been GalNAc glycosylated by other GalNAc- 

15 transferases (Bennett et al., FEBS Letters 460: 226-230 (1999); Ten Hagen et al., J. Biol 
Chem. 276: 17395-17404 (2001); Bennett et al., J. Biol Chem. 273: 30472-30481 (1998); 
Ten Hagen et al., J. Biol Chem. 274: 27867-27874 (1999)). GalNAc-T4 and -T7 recognize 
different GalNAc-glycosylated peptides and catalyse transfer of GalNAc to acceptor substrate 
sites in addition to those that were previously utilized. One of the functions of such GalNAc- 

20 transferase activities is predicted to represent a control step of the density of O-glycan 
occupancy in mucins and mucin-like glycoproteins with high density of O-linked 
glycosylation. 

[0133] One example of this is the glycosylation of the cancer-associated mucin MUC1 . 
MUC1 contains a tandem repeat O-linked glycosylated region of 20 residues 

25 (HGVTSAPDTRPAPGSTAPPA) with five potential O-linked glycosylation sites. 
GalNAc-Tl, -T2, and -T3 can initiate glycosylation of the MUC1 tandem repeat and 
incorporate at only three sites (HGVTSAPDTRPAPGSTAPPA, GalNAc attachment sites 
underlined). GalNAc-T4 is unique in that it is the only GalNAc-transferase isoform 
identified so far that can complete the O-linked glycan attachment to all five acceptor sites in 

30 the 20 amino acid tandem repeat sequence of the breast cancer associated mucin, MUC1. 
GalNAc-T4 transfers GalNAc to at least two sites not used by other GalNAc-transferase 
isoforms on the GalNAc 4 TAP24 glycopeptide (T APP AHGVTS APDTRP AP GSTAPP, 

unique GalNAc-T4 attachment sites are in bold) (Bennett et al., J. Biol Chem. 273: 30472- 
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30481 (1998). An activity such as that exhibited by GalNAc-T4 appears to be required for 
production of the glycoform of MUC1 expressed by cancer cells where all potential sites are 
glycosylated (Muller et al., J. Biol Chem. 274: 18165-18172 (1999)). Normal MUC1 from 
lactating mammary glands has approximately 2.6 O-linked glycans per repeat (Muller et al., 
5 J. Biol. Chem. 272: 24780-24793 (1997) and MUC1 derived from the cancer cell line T47D 
has 4.8 O-linked glycans per repeat (Muller et al., J. Biol Chem. 274: 18165-18172 (1999)). 
The cancer-associated form of MUC1 is therefore associated with higher density of O-linked 
glycan occupancy and this is accomplished by a GalNAc-transferase activity identical to or 
similar to that of GalNAc-T4. 

10 [0134] Polypeptide GalNAc-transferases, which have not displayed apparent GalNAc- 

glycopeptide specificities, also appear to be modulated by their putative lectin domains (PCT 
WO 01/85215 A2). Recently, it was found that mutations in the GalNAc-Tl putative lectin 
domain, similarly to those previously analysed in GalNAc-T4 (Hassan et al , J. Biol Chem. 
275: 38197-38205 (2000)), modified the activity of the enzyme in a similar fashion as 

15 GalNAc-T4. Thus, while wild type GalNAc-Tl added multiple consecutive GalNAc residues 
to a peptide substrate with multiple acceptor sites, mutated GalNAc-Tl failed to add more 
than one GalNAc residue to the same substrate (Tenno et al,J. Biol. Chem, 277(49): 47088- 
96 (2002)). 

[0135] Since it has been demonstrated that mutations of GalNAc transferases can be 
20 utilized to produce glycosylation patterns that are distinct from those produced by the wild- 
type enzymes, it is within the scope of the present invention to utilize one or more mutant 
GalNAc transferase in preparing the O-linked glycosylated peptides of the invention. 

Mutant Peptides with O-linked Glycosylation Sites 

[0136] The peptides provided by the present invention include an amino acid sequence that 
25 is recognized as a GalNAc acceptor by one or more wild-type or mutant GalNac transferases. 
The amino acid sequence of the peptide is either the wild-type, for those peptides that include 
an O-linked glycosylation site, a mutant sequence in which a non-naturally ocurring O-linked 
glycosylation site is introduced, or a polypeptide comprising both naturally occuring and non- 
naturally occuring O-linked glycosylation sites. Exemplary peptides with which the present 
30 invention is practiced include granulocyte colony stimulating factor (G-CSF), e.g., 175 and 
178 amino acid wild types (with or without N-terminal methionine residues), interferon {e.g., 
interferon alpha, e.g., interferon alpha 2b, or interferon alpha 2a), granulocyte macrophage 
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colony stimulating factor (GM-CSF), human growth hormone and interleukin {e.g., 
interleukin 2). The emphasis of the following discussion on G-CSF, GM-CSF and IFN-a 20 
is for clarity of illustration. Any number in the superscript of an amino acid indicates the 
amino acid position relative to the N-terminal methionine of the polypeptide. These numbers 
can be readily adjusted to reflect the absence of N-terminal methionine if the N-terminal of 
the polypeptide starts without a methionine. It is understood that the N-terminals of the 
exemplary peptides can start with or without a methionine. In addition, those of skill will 
understand that the strategy set forth herein for preparing O-linked glycoconjugated 
analogues of wild-type and mutant peptides is applicable to any peptide. , 

[0137] In an exemplary embodiment, the peptide is a biologically active G- 
CSF mutant that includes one or more mutation at a site selected from the N-terminus, 
adjacent to or encompassing H 53 , P 61 , P 129 , P 133 and P 175 . Biologically active G-CSF mutants 
of the present invention include any G-CSF polypeptide, in part or in whole, with one or 
more mutations that do not result in substantial or entire loss of its biological activity as it is 
measured by any suitable functional assays known to one skilled in the art. In one 
embodiment, mutations within the biologically active G-CSF mutants of the present invention 
are located within one or more O-linked glycosylation sites that do not naturally exist in wild 
type G-CSF. In another embodiment, mutations within the biologically active G-CSF 
mutants of the present invention reside within as well as outside of one or more O-linked 
glycosylation sites of the G-CSF mutants. 

[0138] Representative wild type and mutant G-CSF polypeptides have sequences that are 
selected from: 

SEQ. ID NO. 1 (178 amino acid wild type) 

mtplgpasslp qsfllkcleq vrkiqgdgaa lqeklvseca tyklchpeel 
vllghslgip waplsscpsq alqlagclsq lhsglflyqg llqalegisp 
elgptldtlq ldvadfatti wqqmeelgma palqptqgam pafasafqrr 
aggvlvashl qsflevsyrv lrhlaqp; 

SEQ. ID NO. 2 (178 amino acid wild type without N-terminal 
methionine) 

tplgpasslp qsfllkcleq vrkiqgdgaa lqeklvseca tyklchpeel 
vllghslgip waplsscpsq alqlagclsq lhsglflyqg llqalegisp 
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elgptldtlq ldvadfatti wqqmeelgma palqptqgam pafasafqrr 
aggvlvashl qsflevsyrv lrhlaqp; 

SEQ. ID NO. 3 (175 amino acid wild type) 

mtplgpasslp qsfllkcleq vrkiqgdgaa lqeklca tyklchpeel 
vllghslgip waplsscpsq alqlagclsq lhsglflyqg llqalegisp 
elgptldtlq ldvadfatti wqqmeelgma palqptqgam pafasafqrr 
aggvlvashl qsflevsyrv lrhlaqp; 

SEQ. ID NO. 4 (175 amino acid wild type without N-terminal 
methionine) 

mtplgpasslp qsfllkcleq vrkiqgdgaa lqeklca tyklchpeel 
vllghslgip waplsscpsq alqlagclsq lhsglflyqg llqalegisp 
elgptldtlq ldvadfatti wqqmeelgma palqptqgam pafasafqrr 
aggvlvashl qsflevsyrv lrhlaqp; 

SEQ. ID NO. 5 

mvtplgpasslp qsfllkcleq vrkiqgdgaa lqeklca tyklchpeel 
vllghslgip waplsscpsq alqlagclsq lhsglflyqg llqalegisp 
elgptldtlq ldvadfatti wqqmeelgma palqptqgam pafasafqrr 
aggvlvashl qsflevsyrv lrhlaqp; 

SEQ. ID NO. 6 

mvtplgpasslp qsfllkcleq vrkiqgdgaa lqeklca tyklchpeel 
vllghtlgip waplsscpsq alqlagclsq lhsglflyqg llqalegisp 
elgptldtlq ldvadfatti wqqmeelgma palqptqgam pafasafqrr 
aggvlvashl qsflevsyrv lrhlaqp; 

SEQ. ID NO. 7 

mtplgpasslp qsfllkcleq vrkiqgdgaa lqeklca tyklchpeel 
vllghtlgip waplsscpsq alqlagclsq lhsglflyqg llqalegisp 
elgptldtlq ldvadfatti wqqmeelgma palqptqgam pafasafqrr 
aggvlvashl qsflevsyrv lrhlaqp; 

SEQ. ID NO. 8 

mvtplgpasslp qsfllkcleq vrkiqgdgaa lqeklca tyklchpeel 
vllgsslgip waplsscpsq alqlagclsq lhsglflyqg llqalegisp 
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elgptldtlq ldvadfatti wqqmeelgma palqptqgam pafasafqrr 
aggvlvashl qsflevsyrv lrhlaqp; 

SEQ. ID NO. 9 

mqtplgpasslp qsfllkcleq vrkiqgdgaa lqeklca tyklchpeel 
vllghslgip waplsscpsq alqlagclsq lhsglflyqg llqalegisp 
elgptldtlq ldvadfatti wqqmeelgma palqptqgam pafasafqrr 
aggvlvashl qsflevsyrv lrhlaqp; 

SEQ. ID NO. 10 

mtplgpasslp qsfllkcleq vrkiqgdgaa lqeklca tyklchpeel 
vllghslgip waplsscpsq alqlagclsq lhsglflyqg llqalegisp 
elgptldtlq ldvadfatti wqqmeelgma palqptqgam pafasafqrr 
aggvlvashl qsflevsyrv lrhlaqptqgamp; and 

SEQ. ID NO. 11 

mtplgpasslp qsfllkcleq vrkiqgdgaa lqeklca tyklchpeel 
vllgsslgip waplsscpsq alqlagclsq lhsglflyqg llqalegisp 
elgptldtlq ldvadfatti wqqmeelgma palqptqgam pafasafqrr 
aggvlvashl qsflevsyrv lrhlaqp 

SEQ ID NO: 12 

maitplgpasslp qsfllkcleq vrkiqgdgaa lqeklcatyk lchpeelvll 

( 

ghslgipwap lsscpsqalq lagclsqlhs glflyqgllq alegispelg 
ptldtlqldv adfattiwqq meelgmapal qptqgampaf asafqrragg 
vlvashlqsf levsyrvlrh laqp 

SEQ ID NO: 13 

mgvtetplgpasslp qsfllkcleq vrkiqgdgaa lqeklcatyk 
lchpeelvll ghslgipwap lsscpsqalq lagclsqlhs glflyqgllq 
alegispelg ptldtlqldv adfattiwqq meelgmapal qptqgampaf 
asafqrragg vlvashlqsf levsyrvlrh laqp 

SEQ ID NO: 14 

maptplgpasslp qsfllkcleq vrkiqgdgaa lqeklcatyk lchpeelvll 
ghslgipwap lsscpsqalq lagclsqlhs glflyqgllq alegispelg 
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ptldtlqldv adfattiwqq meelgmapal qptqgampaf asafqrragg 
vlvashlqsf levsyrvlrh laqp 

SEQ ID NO: 15 

Mtptqglgpasslp qsfllkcleq vrkiqgdgaa lqeklcatyk lchpeelvll 
ghslgipwap lsscpsqalq lagclsqlhs glflyqgllq alegispelg 
ptldtlqldv adfattiwqq meelgmapal qptqgampaf asafqrragg 
vlvashlqsf levsyrvlrh laqp 

SEQ ID NO:16 

mtplgpasslp qsfllkcleq vrkiqgdgaa lqeklcatyk lchpeelvll 
ghslgipwap lsscpsqalq lagclsqlhs glflyqgllq alegispelg 
ptldtlqldv adfattiwqq meelgmapatqptqgampaf asafqrragg 
vlvashlqsf levsyrvlrh laqp 

SEQIDNO:17 

Mtplgpasslp qsfllkcleq vrkiqgdgaa lqeklcatyk lchpeelvll 
ghslgipftp lsscpsqalq lagclsqlhs glflyqgllq alegispelg 
ptldtlqldv adfattiwqq meelgmapaL qptqgampaf asafqrragg 
vlvashlqsf levsyrvlrh laqp 

SEQ ID NO: 18 

mtplgpasslpqsfllkcleqvrkiqgdgaalqeklcatyklchpeelvllghslgi 

pwaplsscpsqalqlagclsqlhsglflyqgllqalegispelgptldtlqldvadfa 

ttiwqqmeelgmapalqptqtampafasafqrraggvlvashlqsflevsyrvlr 
hlaqp. 

[0139] In another exemplary embodiment, the peptide is a biologically active hGH mutant 
that includes one or more mutations at a site selected from the N-terminus or adjacent to or 
encompassing P 133 . Biologically active hGH mutants of the present invention include any 
hGH polypeptide, in part or in whole, with one or more mutations that do not result in 
substantial or entire loss of its biological activity as it is measured by any suitable functional 
assays known to one skilled in the art. In one embodiment, mutations within the biologically 
active hGH mutants of the present invention are located within one or more O-linked 
glycosylation sites that do not naturally exist in wild type hGH. In another embodiment, 
mutations within the biologically active hGH mutants of the present invention reside within 
as well as outside of one or more O-linked glycosylation sites of the hGH mutants. 
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[0140] Representative wild type and mutant hGH polypeptides have sequences that are 
selected from: 

SEQ ID NO: 19 (192 amino acid wild-type pituitary derived hGH 

* 

comprising an N-terminal methionine) 

mrptiplsrlfdnamlrahrlhqlafdtyqefeeayipkeqkysflqnpqtslcfse 
siptpsnreetqqksnlellrisllliqswlepvqflrsvfanslvygasdsnvydllk 
dleegiqtlmgrledgsprtgqifkqtyskfdtnshnddallknygllycfrkdm 
dkvetflrivqcrsvegscgf 

SEQ ID NO:20 (191 amino acid wild-type pituitary derived hGH 

lacking an N-Terminal methionine) 

fptiplsrlfdnamlrahrlhqlafdtyqefeeayipkeqkysflqnpqtslcfsesi 
plpsnreetqqksnlellrisllliqswlepvqflrsvfanslvygasdsnvydllkd 
leegiqtlmgrledgsprtgqifkqtyskfdtnshnddallknygllycfrkdmd 
kvetflrivqcrsvegscgf 

SEQ ID NO: 21 (wild type) 

MFPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEAYI 

PKEQKYSFLQNPQTSLCFSESIPTPSNREETQQKSNLE 

LLRISLLLIQSWLEPVQFLRSVFANSLVYGASDSNVY 

DLLKDLEEGIOTLMGR LEDGSPRTGOIFKQTYS KFDT 

NSHNDDALLKNYGLLYCFRKDMDKVETFLRTVQCR 

SVEGSCGF 

[0141] The following are representative mutant peptide sequences corresponding to the 
region underlined in the wild type SEQ ID NO:21 : LEDGSPTTGQIFKQTYS, 
LEDGSPTTAQIFKQTYS, LEDGSPTATQIFKQTYS, LEDGSPTQGAMFKQTYS, 
LEDGSPTQGAIFKQTYS, LEDGSPTQGQIFKQTYS, LED GSPTTL YVFKQTYS, 
LEDGSPTINTIFKQTYS, LEDGSPTTVSIFKQTYS, LEDGSPRTGQIPTQTYS, 
LEDGSPRTGQIPTQAYS, LEDGSPTTLQIFKQTYS, LETETPRTGQIFKQTYS, 
LVTETPRTGQIFKQTYS, LETQSPRTGQIFKQTYS, LVTQ SPRTGQIFKQT YS , 
LVTETP ATGQIFKQTYS , LEDGSPTQGAMFKQTYS, and LEDGSPTTTQIFKQT YS . 
[0142] In another exemplary embodiment, the peptide is a biologically active IFN alpha 
mutant that includes one or more mutations at a site corresponding to T of INF alpha 2, 
e.g., adjacent to or encompassing an amino acid position in IFN alpha wild type, which 
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corresponds to or aligns with T 106 of INF alpha 2. Biologically active IFN alpha mutants of 
the present invention include any IFN alpha polypeptide, in part or in whole, with one or 
more mutations that do not result in substantial or entire loss of its biological activity as it is 
measured by any suitable functional assays known to one skilled in the art. In one 
5 embodiment, mutations within the biologically active IFN alpha mutants of the present 
invention are located within one or more O-linked glycosylation sites that do not naturally 
exist in wild type IFN alpha. In another embodiment, mutations within the biologically 
active IFN alpha mutants of the present invention reside within as well as outside of one or 
more O-linked glycosylation sites of the IFN alpha mutants. 
1 0 [0143] A wild type and mutant IFN alpha polypeptide is shown below: 

SEQ ID NO:22 (from wild type IFN 2b) 

98 CVIQGVGVTETPLMKEDSIL 1 17 

[0144] Other appropriate O-linked glycosylation sequences for G-CSF and peptides other 
than G-CSF can be determined by preparing a polypeptide incorporating a putative O-linked 

15 glycosylation site and submitting that polypeptide to suitable O-linked glycosylation 

conditions, thereby confirming its ability to serve as an acceptor for a GalNac transferase. 
Moreover, as will be apparent to one of skill in the art, peptides that include one or more 
mutation are within the scope of the present invention. The mutations are designed to allow 
the adjustment of desirable properties of the peptides, e.g., activity and number and position 

20 of O- and/or N-linked glycosylation sites on the peptide. 

a 

Acquisition of Peptide Coding Sequences 
General Recombinant Technology 

[0145] This invention relies on routine techniques in the field of recombinant genetics. 
Basic texts disclosing the general methods of use in this invention include Sambrook and 
25 Russell, Molecular Cloning, A Laboratory Manual (3rd ed. 2001); Kriegler, Gene Transfer 
and Expression: A Laboratory Manual (1990); and Ausubel et al, eds., Current Protocols in 
Molecular Biology ( 1 994). 

[0146] For nucleic acids, sizes are given in either kilobases (kb) or base pairs (bp). These 
are estimates derived from agarose or acrylamide gel electrophoresis, from sequenced nucleic 
30 acids, or from published DNA sequences. For proteins, sizes are given in kilodaltons (kDa) 
or amino acid residue numbers. Proteins sizes are estimated from gel electrophoresis, from 
sequenced proteins, from derived amino acid sequences, or from published protein sequences. 
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[0147] Oligonucleotides that are not commercially available can be chemically synthesized, 
e.g., according to the solid phase phosphoramidite triester method first described by 
Beaucage & Caruthers, Tetrahedron Lett. 22: 1859-1862 (1981), using an automated 
synthesizer, as described in Van Devanter et. aL, Nucleic Acids Res. 12: 6159-6168 (1984). 
5 Entire genes can also be chemically synthesized. Purification of oligonucleotides is 

performed using any art-recognized strategy, e.g., native acrylamide gel electrophoresis or 
anion-exchange HPLC as described in Pearson & Reanier, J. Chrom. 255: 137-149 (1983). 

[0148] The sequence of the cloned wild-type peptide genes, polynucleotide encoding 
mutant peptides, and synthetic oligonucleotides can be verified after cloning using, e.g., the 
1 0 chain termination method for sequencing double-stranded templates of Wallace et ah , Gene 
16:21-26 (1981). 

Cloning and Subcloning of a Wild-Type Peptide Coding Sequence 
[0149] Numerous polynucleotide sequences encoding wild-type peptides have been 
determined and are available from a commercial supplier, e.g., human growth hormone, e.g., 
15 GenBank Accession Nos. NM 000515, NM 002059, NM 022556, NM 022557, NM 022558, 
NM 022559, NM 022560, NM 022561, and NM 022562. 

[0150] The rapid progress in the studies of human genome has made possible a cloning 
approach where a human DNA sequence database can be searched for any gene segment that 
has a certain percentage of sequence homology to a known nucleotide sequence, such as one 

20 encoding a previously identified peptide. Any DNA sequence so identified can be 

subsequently obtained by chemical synthesis and/or a polymerase chain reaction (PCR) 
technique such as overlap extension method. For a short sequence, completely de novo 
synthesis may be sufficient; whereas further isolation of full length coding sequence from a 
human cDNA or genomic library using a synthetic probe may be necessary to obtain a larger 

25 gene. 

[0151] Alternatively, a nucleic acid sequence encoding a peptide can be isolated from a 
human cDNA or genomic DNA library using standard cloning techniques such as polymerase 
chain reaction (PCR), where homology-based primers can often be derived from a known 
nucleic acid sequence encoding a peptide. Most commonly used techniques for this purpose 
30 are described in standard texts, e.g. , Sambrook and Russell, supra. 

[0152] cDNA libraries suitable for obtaining a coding sequence for a wild-type peptide 
may be commercially available or can be constructed. The general methods of isolating 
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mRNA, making cDNA by reverse transcription, ligating cDNA into a recombinant vector, 
transfecting into a recombinant host for propagation, screening, and cloning are well known 
{see, e.g., Gubler and Hoffman, Gene, 25: 263-269 (1983); Ausubel et al, supra). Upon 
obtaining an amplified segment of nucleotide sequence by PCR, the segment can be further 
5 used as a probe to isolate the full-length polynucleotide sequence encoding the wild-type 
peptide from the cDNA library. A general description of appropriate procedures can be 
found in Sambrook and Russell, supra. 1 

[0153] A similar procedure can be followed to obtain a full length sequence encoding a 
wild-type peptide, e.g., any one of the GenBank Accession Nos mentioned above, from a 

10 human genomic library. Human genomic libraries are commercially available or can be 

constructed according to various art-recognized methods. In general, to construct a genomic 
library, the DNA is first extracted from an tissue where a peptide is likely found. The DNA is 
then either mechanically sheared or enzymatically digested to yield fragments of about 12-20 kb 
in length. The fragments are subsequently separated by gradient centrifugation from 

1 5 polynucleotide fragments of undesired sizes and are inserted in bacteriophage X vectors. These 
vectors and phages are packaged in vitro. Recombinant phages are analyzed by plaque 
hybridization as described in Benton and Davis, Science, 196: 180-182 (1977). Colony 
hybridization is carried out as described by Grunstein et al, Proc. Natl. Acad. Set. USA, 72: 
3961-3965 (1975). 

20 [0154] Based on sequence homology, degenerate oligonucleotides can be designed as 

primer sets and PCR can be performed under suitable conditions (see, e.g., White et al, PCR 
Protocols: Current Methods and Applications, 1993; Griffin and Griffin, PCR Technology, 
CRC Press Inc. 1994) to amplify a segment of nucleotide sequence from a cDNA or genomic 
library. Using the amplified segment as a probe, the full-length nucleic acid encoding a wild- 

25 type peptide is obtained. 

[0155] Upon acquiring a nucleic acid sequence encoding a wild-type peptide, the coding 
sequence can be subcloned into a vector, for instance, an expression vector, so that a 
recombinant wild-type peptide can be produced from the resulting construct. Further 
modifications to the wild-type peptide coding sequence, e.g., nucleotide substitutions, may be 
30 subsequently made to alter the characteristics of the molecule. 
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Introducing Mutations into a Peptide Sequence 

[0156] From an encoding polynucleotide sequence, the amino acid sequence of a wild-type 
peptide can be determined. Subsequently, this amino acid sequence may be modified to alter 
the protein's glycosylation pattern, by introducing additional glycosylation site(s) at various 
5 locations in the amino acid sequence. 

[0157] Several types of protein glycosylation sites are well known in the art. For instance, 
in eukaryotes, N-linked glycosylation occurs on the asparagine of the consensus sequence 
Asn-X aa -Ser/Thr, in which X aa is any amino acid except proline (Kornfeld et al., Ann Rev 
Biochem 54:631-664 (1985); Kukuruzinska et al , Proc. Natl Acad. Set USA 84:2145-2149 

10 (1987); Herscovics et al, FASEB .77:540-550 (1993); and Orlean, Saccharomyces Vol. 3 
(1996)). O-linked glycosylation takes place at serine or threonine residues (Tanner et al, 
Biochim. Biophys. Acta. 906:81-91 (1987); and Hounsell et al, Glycoconj. J. 13:19-26 
(1996)). Other glycosylation patterns are formed by linking glycosylphosphatidylinositol to 
the carboxyl-terminal carboxyl group of the protein (Takeda et aL, Trends Biochem. Set 

15 20:367-371 (1995); and Udenfriend etal,Ann. Rev. Biochem. 64:593-591 (1995). Based on 
this knowledge, suitable mutations can thus be introduced into a wild-type peptide sequence 
to form new glycosylation sites. 

[0158] Although direct modification of an amino acid residue within a peptide polypeptide 
sequence may be suitable to introduce a new N-linked or O-linked glycosylation site, more 
20 frequently, introduction of a new glycosylation site is accomplished by mutating the 

polynucleotide sequence encoding a peptide. This can be achieved by using any of known 
mutagenesis methods, some of which are discussed below. Exemplary modifications to a G- 
CSF peptide include those illustrated in SEQ ID NO:5-18. 

[0159] A variety of mutation- generating protocols are established and described in the art. 
25 See, e.g., Zhang et al., Proc. Natl Acad. Sci. USA, 94: 4504-4509 (1997); and Stemmer, 

r 

Nature, 370: 389-391 (1994). The procedures can be used separately or in combination to 
produce variants of a set of nucleic acids, and hence variants of encoded polypeptides. Kits 
for mutagenesis, library construction, and other diversity-generating methods are 
commercially available. 

30 [0160] Mutational methods of generating diversity include, for example, site-directed 
mutagenesis (Botstein and Shortle, Science, 229: 1193-1201 (1985)), mutagenesis using 
uracil-containing templates (Kunkel, Proc. Natl Acad. Sci. USA, 82: 488-492 (1985)), 
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oligonucleotide-directed mutagenesis (Zoller and Smith, Nucl. Acids Res., 10: 6487-6500 
(1982)), phosphorothioate-modified DNA mutagenesis (Taylor et hi, Nucl Acids Res., 13: 
8749-8764 and 8765-8787 (1985)), and mutagenesis using gapped duplex DNA (Kramer et 
id. 9 Nucl Acids Res., 12: 9441-9456 (1984)). 

5 [0161] Other methods for generating mutations include point mismatch repair (Kramer et 
al 9 Cell, 38: 879-887 (1984)), mutagenesis using repair-deficient host strains (Carter et al, 
Nucl Acids Res., 13: 4431-4443 (1985)), deletion mutagenesis (Eghtedarzadeh and Henikoff, 
Nucl Acids Res., 14: 51 15 (1986)), restriction-selection and restriction-purification (Wells et 
al,Phil Trans. R. Soc. Lond. A, 317: 415-423 (1986)), mutagenesis by total gene synthesis 
1 0 (Nambiar et al , Science, 223 : 1 299- 1301 (1984)), double-strand break repair (Mandecki, 
Proc. Natl Acad. Sci. USA, 83: 7177-7181 (1986)), mutagenesis by polynucleotide chain 
termination methods (U.S. Patent No. 5,965,408), and error-prone PGR (Leung et al, 
Biotechniques, 1: 11-15 (1989)). 

Modification of Nucleic Acids for Preferred Codon Usage in a Host Organism 
1 5 [0162] The polynucleotide sequence encoding a mutant peptide can be further altered to 
coincide with the preferred codon usage of a particular host. For example, the preferred 
codon usage of one strain of bacterial cells can be used to derive a polynucleotide that 
encodes a mutant peptide of the invention and includes the codons favored by this strain. The 
frequency of preferred codon usage exhibited by a host cell can be calculated by averaging 
20 frequency of preferred codon usage in a large number of genes expressed by the host cell 
{e.g., calculation service is available from web site of the Kazusa DNA Research Institute, 
Japan). This analysis is preferably limited to genes that are highly expressed by the host cell. 
U.S. Patent No. 5,824,864, for example, provides the frequency of codon usage by highly 
expressed genes exhibited by dicotyledonous plants and monocotyledonous plants. 

25 [0163] At the completion of modification, the mutant peptide coding sequences are verified 
by sequencing and are then subcloned into an appropriate expression vector for recombinant 
production in the same manner as the wild-type peptides. 

Expression and Purification of the Mutant Peptide 

[0164] Following sequence verification, the mutant peptide of the present invention can be 
30 produced using routine techniques in the field of recombinant genetics, relying on the 
polynucleotide sequences encoding the polypeptide disclosed herein. 
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Expression Systems 

[0165] To obtain high-level expression of a nucleic acid encoding a mutant peptide of the 
present invention, one typically subclones a polynucleotide encoding the mutant peptide into 
an expression vector that contains a strong promoter to direct transcription, a 
5 transcription/translation terminator and a ribosome binding site for translational initiation. 
Suitable bacterial promoters are well known in the art and described, e.g., in Sambrook and 
Russell, supra, and Ausubel et al, supra. Bacterial expression systems for expressing the 
wild-type or mutant peptide are available in, e.g., E. coli s Bacillus sp. } Salmonella, and 
Caulobacter. Kits for such expression systems are commercially available. Eukaryotic 
1 0 expression systems for mammalian cells, yeast, and insect cells are well known in the art and 
are also commercially available. In one embodiment, the eukaryotic expression vector is an 
adenoviral vector, an adeno-associated vector, or a retroviral vector. 

[0166] The promoter used to direct expression of a heterologous nucleic acid depends on 
the particular application. The promoter is optionally positioned about the same distance 
1 5 from the heterologous transcription start site as it is from the transcription start site in its 
natural setting. As is known in the art, however, some variation in this distance can be 
accommodated without loss of promoter function. 

[0167] In addition to the promoter, the expression vector typically includes a transcription 
unit or expression cassette that contains all the additional elements required for the 

20 expression of the mutant peptide in host cells. A typical expression cassette thus contains a 
promoter operably linked to the nucleic acid sequence encoding the mutant peptide and 
signals required for efficient polyadenylation of the transcript, ribosome binding sites, and 
translation termination. The nucleic acid sequence encoding the peptide is typically linked to 
a cleavable signal peptide sequence to promote secretion of the peptide by the transformed 

25 cell. Such signal peptides include, among others, the signal peptides from tissue plasminogen 
activator, insulin, and neuron growth factor, and juvenile hormone esterase of Heliothis 
virescens. Additional elements of the cassette may include enhancers and, if genomic DNA 
is used as the structural gene, introns with functional splice donor and acceptor sites. 

[0168] In addition to a promoter sequence, the expression cassette should also contain a 
30 transcription termination region downstream of the structural gene to provide for efficient 
termination. The termination region may be obtained from the same gene as the promoter 
sequence or may be obtained from different genes. 
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[0169] The particular expression vector used to transport the genetic information into the 
cell is not particularly critical. Any of the conventional vectors used for expression in 
eukaryotic or prokaryotic cells may be used. Standard bacterial expression vectors include 
plasmids such as pBR322~based plasmids, pSKF, pET23D, and fusion expression systems 
5 such as GST and LacZ. Epitope tags can also be added to recombinant proteins to provide 
convenient methods of isolation, e.g., c-myc. 

[0170] Expression vectors containing regulatory elements from eukaryotic viruses are 
typically used in eukaryotic expression vectors, e.g., SV40 vectors, papilloma virus vectors, 
and vectors derived from Epstein-Barr virus. Other exemplary eukaryotic vectors include 
1 0 pMSG, pAV009/A + , pMTO 1 0/A + , pMAMneo-5, baculovirus pDS VE, and any other vector 
allowing expression of proteins under the direction of the SV40 early promoter, SV40 later 
promoter, metallothionein promoter, murine mammary tumor virus promoter, Rous sarcoma 
virus promoter, polyhedrin promoter, or other promoters shown effective for expression in 
eukaryotic cells. 

1 5 [0171] In some exemplary embodiments the expression vector is chosen from pCWinl , 
pCWin2, pCWin2/MBP, pCWin2-MBP-SBD (pMS 39 ), and pCWin2-MBP-MCS-SBD 
(PMXS39) as disclosed in co-owned U.S. Patent application filed April 9, 2004 which is 
incorporated herein by reference. 

[0172] Some expression systems have markers that provide gene amplification such as 
20 thymidine kinase, hygromycin B phosphotransferase, and dihydrofolate reductase. 

Alternatively, high yield expression systems not involving gene amplification are also 
suitable, such as a baculovirus vector in insect cells, with a polynucleotide sequence encoding 
the mutant peptide under the direction of the polyhedrin promoter or other strong baculovirus 
promoters. 

25 [0173] The elements that are typically included in expression vectors also include a 

replicon that functions in E. coli, a gene encoding antibiotic resistance to permit selection of 
bacteria that harbor recombinant plasmids, and unique restriction sites in nonessential regions 
of the plasmid to allow insertion of eukaryotic sequences. The particular antibiotic resistance 
gene chosen is not critical, any of the many resistance genes known in the art are suitable. 

30 The prokaryotic sequences are optionally chosen such that they do not interfere with the 
replication of the DNA in eukaryotic cells, if necessary. 
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[0174] When periplasmic expression of a recombinant protein (e.g. , a hgh mutant of the 
present invention) is desired, the expression vector further comprises a sequence encoding a 
secretion signal, such as the E. coli OppA (Periplasmic Oligopeptide Binding Protein) 
secretion signal or a modified version thereof, which is directly connected to 5' of the coding 
5 sequence of the protein to be expressed. This signal sequence directs the recombinant protein 
produced in cytoplasm through the cell membrane into the periplasmic space. The expression 
vector may further comprise a coding sequence for signal peptidase 1 , which is capable of 
enzymatically cleaving the signal sequence when the recombinant protein is entering the 
periplasmic space. More detailed description for periplasmic production of a recombinant 
10 protein can be found in, e.g., Gray et al 9 Gene 39: 247-254 (1985), U.S. Patent Nos. 
6,160,089 and 6,436,674. 

[0175] As discussed above, a person skilled in the art will recognize that various 
conservative substitutions can be made to any wild-type or mutant peptide or its coding 
sequence while still retaining the biological activity of the peptide. Moreover, modifications 
15 of a polynucleotide coding sequence may also be made to accommodate preferred codon 
usage in a particular expression host without altering the resulting amino acid sequence. 

Transfection Methods 

[0176] Standard transfection methods are used to produce bacterial, mammalian, yeast or 
insect cell lines that express large quantities of the mutant peptide, which are then purified 
20 using standard techniques {see, e.g., Colley et al., J. Biol. Chem. 264: 17619-17622 (1989); 
Guide to Protein Purification, in Methods in Enzymology, vol. 182 (Deutscher, ed., 1990)). 
Transformation of eukaryotic and prokaryotic cells are performed according to standard 
techniques (see, e.g., Mcprrison, J. Bad 132: 349-351 (1977); Clark-Curtiss & Curtiss, 
Methods in Enzymology 101: 347-362 (Wu et ah, eds, 1983). 

25 [0177] Any of the well-known procedures for introducing foreign nucleotide sequences 
into host cells may be used. These include the use of calcium phosphate transfection, 
polybrene, protoplast fusion, electroporation, liposomes, microinjection, plasma vectors, viral 
vectors and any of the other well known methods for introducing cloned genomic DNA, 
cDNA, synthetic DNA, or other foreign genetic material into a host cell (see, e.g., Sambrook 

30 and Russell, supra). It is only necessary that the particular genetic engineering procedure 
used be capable of successfully introducing at least one gene into the host cell capable of 
expressing the mutant peptide. 
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Detection of Expression of Mutant Peptide in Host Cells 

[0178] After the expression vector is introduced into appropriate host cells, the transfected 
cells are cultured under conditions favoring expression of the mutant peptide. The cells are 
then screened for the expression of the recombinant polypeptide, which is subsequently 
5 recovered from the culture using standard techniques (see, e.g., Scopes, Protein Purification: 
Principles and Practice (1982); U.S. Patent No. 4,673,641; Ausubel et aL, supra; and 
Sambrook and Russell, supra). 

[0179] Several general methods for screening gene expression are well known among those 
skilled in the art. First, gene expression can be detected at the nucleic acid level. A variety 

10 of methods of specific DNA and RNA measurement using nucleic acid hybridization 

techniques are commonly used (e.g., Sambrook and Russell, supra). Some methods involve 
an electrophoretic separation (e.g., Southern blot for detecting DNA and Northern blot for 
detecting RNA), but detection of DNA or RNA can be carried out without electrophoresis as 
well (such as by dot blot). The presence of nucleic acid encoding a mutant peptide in 

1 5 transfected cells can also be detected by PGR or RT-PCR using sequence-specific primers. 

[0180] Second, gene expression can be detected at the polypeptide level. Various 
immunological assays are routinely used by those skilled in the art to measure the level of a 
gene product, particularly using polyclonal or monoclonal antibodies that react specifically 
with a mutant peptide of the present invention, such as a polypeptide having the amino acid 

20 sequence of SEQ ID NO: 1-7, (e.g., Harlow and Lane, Antibodies, A Laboratory Manual, 
Chapter 14, Cold Spring Harbor, 1988; Kohler and Milstein, Nature, 256: 495-497 (1975)). 
Such techniques require antibody preparation by selecting antibodies with high specificity 
against the mutant peptide or an antigenic portion thereof. The methods of raising polyclonal 
and monoclonal antibodies are well established and their descriptions can be found in the 

25 literature, see, e.g., Harlow and Lane, supra; Kohler and Milstein, Eur. J. Immunol., 6: 511- 
519 (1976). More detailed descriptions of preparing antibody against the mutant peptide of 
the present invention and conducting immunological assays detecting the mutant peptide are 
provided in a later section. 

Purification of Recombinantly Produced Mutant Peptide 
30 [0181] Once the expression of a recombinant mutant peptide in transfected host cells is 

confirmed, the host cells are then cultured in an appropriate scale for the purpose of purifying 
the recombinant polypeptide. 
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1. Purification of Recombinantly Produced Mutant Peptide from Bacteria 
[0182] When the mutant peptides of the present invention are produced recombinantly by 
transformed bacteria in large amounts, typically after promoter induction, although 
expression can be constitutive, the proteins may form insoluble aggregates. There are several 
5 protocols that are suitable for purification of protein inclusion bodies. For example, 
purification of aggregate proteins (hereinafter referred to as inclusion bodies) typically 
involves the extraction, separation and/or purification of inclusion bodies by disruption of 
bacterial cells, e.g., by incubation in a buffer of about 100-150 |Lig/ml lysozyme and 0.1% 
Nonidet P40, a non-ionic detergent. The cell suspension can be ground using a Polytron 
10 grinder (Brinkman Instruments, Westbury, NY). Alternatively, the cells can be sonicated on 
ice. Alternate methods of lysing bacteria are described in Ausubel et ah and Sambrook and 
Russell, both supra, and will be apparent to those of skill in the art. 

[0183] The cell suspension is generally centrifuged and the pellet containing the inclusion 
bodies resuspended in buffer which does not dissolve but washes the inclusion bodies, e.g. , 
15 20 mM Tris-HCl (pH 7.2), 1 mM EDTA, 150 mM NaCl and 2% Triton-X 100, a non-ionic 
detergent. It may be necessary to repeat the wash step to remove as much cellular debris as 
possible. The remaining pellet of inclusion bodies may be resuspended in an appropriate 
buffer (e.g., 20 mM sodium phosphate, pH 6.8, 150 mM NaCl). Other appropriate buffers 
will be apparent to those of skill in the art. 

20 [0184] Following the washing step, the inclusion bodies are solubilized by the addition of a 
solvent that is both a strong hydrogen acceptor and a strong hydrogen donor (or a 

combination of solvents each having one of these properties). The proteins that formed the 

i 

inclusion bodies may then be renatured by dilution or dialysis with a compatible buffer. 
Suitable solvents include, but are not limited to, urea (from about 4 M to about 8 M), 

25 formamide (at least about 80%, volume/volume basis), and guanidine hydrochloride (from 
about 4 M to about 8 M). Some solvents that are capable of solubilizing aggregate-forming 
proteins, such as SDS (sodium dodecyl sulfate) and 70% formic acid, may be inappropriate 
for use in this procedure due to the possibility of irreversible denaturation of the proteins, 
accompanied by a lack of immunogenicity and/or activity. Although guanidine 

30 hydrochloride and similar agents are denaturants, this denaturation is not irreversible and 

renaturation may occur upon removal (by dialysis, for example) or dilution of the denaturant, 
allowing re-formation of the immunologically and/or biologically active protein of interest. 
After solubilization, the protein can be separated from other bacterial proteins by standard 
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% 

separation techniques. For further description of purifying recombinant peptide from 
bacterial inclusion body, see, e.g., Patra et al, Protein Expression and Purification 18: 182- 
190 (2000). 

[0185] Alternatively, it is possible to purify recombinant polypeptides, e.g., a mutant 
5 peptide, from bacterial periplasm. Where the recombinant protein is exported into the 
periplasm of the bacteria, the periplasmic fraction of the bacteria can be isolated by cold 
osmotic shock in addition to other methods known to those of skill in the art {see e.g., 
Ausubel et al 9 supra). To isolate recombinant proteins from the periplasm, the bacterial cells 
are centrifuged to form a pellet. The pellet is resuspended in a buffer containing 20% 
10 sucrose. To lyse the cells, the bacteria are centrifuged and the pellet is resuspended in ice- 
cold 5 mM MgSC>4 and kept in an ice bath for approximately 1 0 minutes. The cell 
suspension is centrifuged and the supernatant decanted and saved. The recombinant proteins 
present in the supernatant can be separated from the host proteins by standard separation 
techniques well known to those of skill in the art. 

15 2. Standard Protein Separation Techniques for Purification 

[0186] When a recombinant polypeptide, e.g. , the mutant peptide of the present invention, 
is expressed in host cells in a soluble form, its purification can follow the standard protein 
purification procedure described below. 

I Solubility Fractionation 

20 [0187] Often as an initial step, and if the protein mixture is complex, an initial salt 

fractionation can separate many of the unwanted host cell proteins (or proteins derived from 
the cell culture media) from the recombinant protein of interest, e.g., a mutant peptide of the 
present invention. The preferred salt is ammonium sulfate. Ammonium sulfate precipitates 
proteins by effectively reducing the amount of water in the protein mixture. Proteins then 

25 precipitate on the basis of their solubility. The more hydrophobic a protein is, the more likely 
it is to precipitate at lower ammonium sulfate concentrations. A typical protocol is to add 
saturated ammonium sulfate to a protein solution so that the resultant ammonium sulfate 
concentration is between 20-30%. This will precipitate the most hydrophobic proteins. The 
precipitate is discarded (unless the protein of interest is hydrophobic) and ammonium sulfate 

30 is added to the supernatant to a concentration known to precipitate the protein of interest. 

The precipitate is then solubilized in buffer and the excess salt removed if necessary, through 
either dialysis or diafiltration. Other methods that rely on solubility of proteins, such as cold 

49 



WO 2005/070138 PCT/US2005/000799 

ethanol precipitation, are well known to those of skill in the art and can be used to fractionate 
complex protein mixtures. 

it Size Differential Filtration 

[0188] Based on a calculated molecular weight, a protein of greater and lesser size can be 
5 isolated using ultrafiltration through membranes of different pore sizes (for example, Amicon 
or Millipore membranes). As a first step, the protein mixture is ultrafiltered through a 
membrane with a pore size that has a lower molecular weight cut-off than the molecular 
weight of a protein of interest, e.g., a mutant peptide. The retentate of the ultrafiltration is 
then ultrafiltered against a membrane with a molecular cut off greater than the molecular 
1 0 weight of the protein of interest. The recombinant protein will pass through the membrane 
into the filtrate. The filtrate can then be chromatographed as described below. 

ill Column Chromatography 

[0189] The proteins of interest (such as the mutant peptide of the present invention) can 
also be separated from other proteins on the basis of their size, net surface charge, 
1 5 hydrophobicity, or affinity for ligands. In addition, antibodies raised against peptide can be 
conjugated to column matrices and the peptide immunopurified. All of these methods are 
well known in the art. 

[0190] It will be apparent to one of skill that chromatographic techniques can be performed 
at any scale and using equipment from many different manufacturers (e.g., Pharmacia 
20 Biotech). 

Immunoassays for Detection of Mutant Peptide Expression 

[0191] To confirm the production of a recombinant mutant peptide, immunological assays 
may be useful to detect in a sample the expression of the polypeptide. Immunological assays 
are also useful for quantifying the expression level of the recombinant hormone. Antibodies 
25 against a mutant peptide are necessary for carrying out these immunological assays. 

Production of Antibodies against Mutant Peptide 

[0192] Methods for producing polyclonal and monoclonal antibodies that react specifically 
with an immunogen of interest are known to those of skill in the art (see, e.g. } Coligan, 
Current Protocols in Immunology Wiley/Greene, NY, 1991; Harlow and Lane, Antibodies: A 
30 Laboratory Manual Cold Spring Harbor Press, NY, 1989; Stites et al. (eds.) Basic and 

Clinical Immunology (4th ed.) Lange Medical Publications, Los Altos, CA, and references 
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cited therein; Goding, Monoclonal Antibodies: Principles and Practice (2d ed.) Academic 
Press, New York, NY, 1986; and Kohler and Milstein Nature 256: 495-497, 1975). Such 
techniques include antibody preparation by selection of antibodies from libraries of 
recombinant antibodies in phage or similar vectors {see, Huse et al., Science 246: 1275-1281, 
5 1989; and Ward et al., Nature 341: 544-546, 1989). 

[0193] In order to produce antisera containing antibodies with desired specificity, the 

polypeptide of interest (e.g., a mutant peptide of the present invention) or an antigenic 

i" 

fragment thereof can be used to immunize suitable animals, e.g., mice, rabbits, or primates. 
A standard adjuvant, such as Freund's adjuvant, can be used in accordance with a standard 
10 immunization protocol. Alternatively, a synthetic antigenic peptide derived from that 

particular polypeptide can be conjugated to a carrier protein and subsequently used as an 
immunogen. 

[0194] The animal's immune response to the immunogen preparation is monitored by 
taking test bleeds and determining the titer of reactivity to the antigen of interest. When 
1 5 appropriately high titers of antibody to the antigen are obtained, blood is collected from the 
animal and antisera are prepared. Further fractionation of the antisera to enrich antibodies 
specifically reactive to the antigen and purification of the antibodies can be performed 
subsequently, see, Harlow and Lane, supra, and the general descriptions of protein 
purification provided above. 

20 [0195] Monoclonal antibodies are obtained using various techniques familiar to those of 
skill in the art. Typically, spleen cells from an animal immunized with a desired antigen are 
immortalized, commonly by fusion with a myeloma cell (see, Kohler and Milstein, Eur. J. 
Immunol. 6:511-519, 1976). Alternative methods of immortalization include, e.g., 
transformation with Epstein Barr Virus, oncogenes, or retroviruses, or other methods well 

25 known in the art. Colonies arising from single immortalized cells are screened for production 
of antibodies of the desired specificity and affinity for the antigen, and the yield of the 
monoclonal antibodies produced by such cells may be enhanced by various techniques, 
including injection into the peritoneal cavity of a vertebrate host. 

[0196] Additionally, monoclonal antibodies may also be recombinantly produced upon 
30 identification of nucleic acid sequences encoding an antibody with desired specificity or a 
binding fragment of such antibody by screening a human B cell cDNA library according to 
the general protocol outlined by Huse et al. , supra. The general principles and methods of 
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recombinant polypeptide production discussed above are applicable for antibody production 
by recombinant methods. 

V 

[0197] When desired, antibodies capable of specifically recognizing a mutant peptide of the 
present invention can be tested for their cross-reactivity against the wild-type peptide and 
5 thus distinguished from the antibodies against the wild-type protein. For instance, antisera 
obtained from an animal immunized with a mutant peptide can be run through a column on 
which a wild-type peptide is immobilized. The portion of the antisera that passes through the 
column recognizes only the mutant peptide and not the wild-type peptide. Similarly, 
monoclonal antibodies against a mutant peptide can also be screened for their exclusivity in 
1 0 recognizing only the mutant but not the wild-type peptide. 

[0198] Polyclonal or monoclonal antibodies that specifically recognize only the mutant 
peptide of the present invention but not the wild-type peptide are useful for isolating the 
mutant protein from the wild-type protein, for example, by incubating a sample with a mutant 
peptide-specific polyclonal or monoclonal antibody immobilized on a solid support. 

1 5 Immunoassays for Detecting Mutant Peptide Expression 

[0l£9] Once antibodies specific for a mutant peptide of the present invention are available, 
the amount of the polypeptide in a sample, e.g., a, cell lysate, can be measured by a variety of 
immunoassay methods providing qualitative and quantitative results to a skilled artisan. For 
a review of immunological and immunoassay procedures in general see, e.g., Stites, supra; 

20 U.S. Patent Nos. 4,366,241; 4,376,110; 4,517,288; and 4,837,168. 

Labeling in Immunoassays 

[0200] Immunoassays often utilize a labeling agent to specifically bind to and label the 
binding complex formed by the antibody and the target protein. The labeling agent may itself 
be one of the moieties comprising the antibody/target protein complex, or may be a third 

25 moiety, such as another antibody, that specifically binds to the antibody/target protein 
complex. A label may be detectable by spectroscopic, photochemical, biochemical, 
immunochemical, electrical, optical or chemical means. Examples include, but are not 
limited to, magnetic beads (e.g., Dynabeads™), fluorescent dyes (e.g., fluorescein 
isothiocyanate, Texas red, rhodamine, and the like), radiolabels (e.g., H, I, S, C, or 

30 32 P), enzymes (e.g. , horse radish peroxidase, alkaline phosphatase, and others commonly used 
in an ELISA), and colorimetric labels such as colloidal gold or colored glass or plastic (e.g., 
polystyrene, polypropylene, latex, etc.) beads. 
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[0201] In some cases, the labeling agent is a second antibody bearing a detectable label. 
Alternatively, the second antibody may lack a label, but it may, in turn, be bound by a labeled 
third antibody specific to antibodies of the species from which the second antibody is 
derived. The second antibody can be modified with a detectable moiety, such as biotin, to 
5 which a third labeled molecule can specifically bind, such as enzyme-labeled streptavidin. 

[0202] Other proteins capable of specifically binding immunoglobulin constant regions, 
such as protein A or protein G, can also be used as the label agents. These proteins are normal 
constituents of the cell walls of streptococcal bacteria. They exhibit a strong non- 
immunogenic reactivity with immunoglobulin constant regions from a variety of species (see, 
10 generally, Kronval, et al. J. Immunol., Ill: 1401-1406 (1973); and Akerstrom, et al., 
J. Immunol, 135: 2589-2542 (1985)). 

Immunoassay Formats 

[0203] Immunoassays for detecting a target protein of interest (e.g. , a mutant human 
growth hormone) from samples may be either competitive or noncompetitive. 

1 5 Noncompetitive immunoassays are assays in which the amount of captured target protein is 
directly measured. In one preferred "sandwich" assay, for example, the antibody specific for 
the target protein can be bound directly to a solid substrate where the antibody is 
immobilized. It then captures the target protein in test samples. The antibody/target protein 
complex thus immobilized is then bound by a labeling agent, such as a second or third 

20 antibody bearing a label, as described above. 

[0204] In competitive assays, the amount of target protein in a sample is measured 
indirectly by measuring the amount of an added (exogenous) target protein displaced (or 
competed away) from an antibody specific for the target protein by the target protein present 
in the sample. In a typical example of such an assay, the antibody is immobilized and the 
25 exogenous target protein is labeled. Since the amount of the exogenous target protein bound 
to the antibody is inversely proportional to the concentration of the target protein present in 
the sample, the target protein level in the sample can thus be determined based on the amount 
of exogenous target protein bound to the antibody and thus immobilized. 

[0205] In some cases, western blot (immunoblot) analysis is used to detect and quantify the 
30 presence of a mutant peptide in the samples. The technique generally comprises separating 
sample proteins by gel electrophoresis on the basis of molecular weight, transferring the 
separated proteins to a suitable solid support (such as a nitrocellulose filter, a nylon filter, or a 
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derivatized nylon filter) and incubating the samples with the antibodies that specifically bind 
the target protein. These antibodies may be directly labeled or alternatively may be 
subsequently detected using labeled antibodies (e.g., labeled sheep anti-mouse antibodies) 
that specifically bind to the antibodies against a mutant peptide. 

5 [0206] Other assay formats include liposome immunoassays (LIA) 5 which use liposomes 
designed to bind specific molecules (e.g., antibodies) and release encapsulated reagents or 
markers. The released chemicals are then detected according to standard techniques (see, 
Monroe etal.,Amer. Clin. Prod. Rev., 5: 34-41 (1986)). 

The Conjugates 

1 0 [0207] In a representative aspect, the present invention provides a glycoconjugate between 
a peptide and a selected modifying group, in which the modifying group is conjugated to the 
peptide through a glycosyl linking group, e.g., an intact glycosyl linking group. The glycosyl 
linking group is directly bound to an O-linked glycosylation site on the peptide or, 
alternatively, it is bound to an O-linked glycosylation site through one or more additional 

15 glycosyl residues. Methods of preparing the conjugates are set forth herein and in U.S. Patent 
No. 5,876,980; 6,030,815; 5,728,554; 5,922,577; WO 98/31826; US2003180835; and WO 
03/031464. 

[0208] Exemplary peptides include an O-linked GalNAc residue that is bound to the O- 
linked glycosylation site through the action of a GalNAc transferase. The GalNAc itself may 
20 be the intact glycosyl linking group. The GalNAc may also be further elaborated by, for 

example, a Gal or Sia residue, either of which can act as the intact glycosyl linking group. In 
representative embodiments, the O-linked saccharyl residue is GalNAc-X , GalNAc-Gal- 
Sia-X , or GalNAc-Gal-Gal-Sia-X, in which X is a modifying group. 

[0209] In an exemplary embodiment, the peptide is a mutant peptide that includes an O- 
25 linked glycosylation site not present in the wild-type peptide. The peptide is preferably O- 
glycosylated at the mutated site with a GalNAc residue. The discussion immediately 
preceding regarding the structure of the saccharyl moiety is relevant here as well. 

[0210] The link between the peptide and the selected moiety includes an intact glycosyl 
linking group interposed between the peptide and the modifying moiety. As discussed 
30 herein, the selected moiety is essentially any species that can be attached to a saccharide unit, 
resulting in a "modified sugar" that is recognized by an appropriate transferase enzyme, 
which appends the modified sugar onto the peptide. The saccharide component of the 
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modified sugar, when interposed between the peptide and a selected moiety, becomes an 
"intact glycosyl linking group." The glycosyl linking group is formed from any mono- or 
oligo-saccharide that, after modification with a selected moiety, is a substrate for an 
appropriate transferase. 

[0211] The conjugates of the invention will typically correspond to the general structure: 




in which the symbols a, b, c, d and s represent a positive, non-zero integer; and t is either 0 or 
a positive integer. The "agent" is a therapeutic agent, a bioactive agent, a detectable lable, 
water-soluble moiety or the like. The "agent" can be a peptide, e.g, enzyme, antibody, 
anitgen, etc. The linker can be any of a wide array of linking groups, infra. Alternatively, 
the linker may be a single bond or a "zero order linker." The identity of the peptide is 
without limitation. 

[0212] In an exemplary embodiment, the selected moiety is a water-soluble polymer, e.g., 
PEG, m-PEG, PPG, m-PPG, etc. The water-soluble polymer is covalently attached to the 
peptide via a glycosyl linking group. The glycosyl linking group is covalently attached to 
either an amino acid residue or a glycosyl residue of the peptide. Alternatively, the glycosyl 
linking group is attached to one or more glycosyl units of a glycopeptide. The invention also 
provides conjugates in which the glycosyl linking group (e.g., GalNAc) is attached to an 
amino acid residue (e.g., Thr or Ser). 

[0213] In an exemplary embodiment, the protein is an interferon. The interferons are 
antiviral glycoproteins that, in humans, are secreted by human primary fibroblasts after 
induction with virus or double-stranded RNA. Interferons are of interest as therapeutics, e.g, 
antiviral agents (e.g., hepatitis B and C), antitumor agents (e.g., hepatocellular carcinoma) 
and in the treatment of multiple sclerosis. For references relevant to interferon-a, see, Asano, 
et al, Eur. J. Cancer, 27(SuppI 4):S21-S25 (1991); Nagy, et al , Anticancer Research, 
8(3):467-470 (1988); Dron, etal 9 J. Biol Regul Homeost. Agents, 3(1):13-19 (1989); Habib, 
etal, Am. Surg., 67(3):257-260 (3/2001); and Sugyiama, etal, Eur. J. Biochem., 217:921- 
927 (1993). For references discussing intefereon-p, see, e.g., Yu, et al, J. Neuroimmunol , 
64(1):91-100 (1996); Schmidt, J., J. Neurosci. Res., 65(l):59-67 (2001); Wender, et al, Folia 
Neuropathol, 39(2):91-93 (2001); Martin, etal, Springer Semin. Immunopathol , 18(l):l-24 
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(1996); Takane, et al. , J. Pharmacol. Exp. Ther., 294(2):746-752 (2000); Sburlati, etal, 
Biotechnol. Prog., 14:189-192 (1998); Dodd, et al. , Biochimica et Biophysica Acta, 787:183- 
187 (1984); Edelbaum, et al.,J. Interferon Res., 12:449-453 (1992); Conradt, et al.,J. Biol. 
Chem., 262(30): 14600-14605 (1987); Civas, etal., Eur. J. Biochem., 173:311-316 (1988); 
Demolder, etal, J. Biotechnol, 32:179-189 (1994);.Sedmak, et al,J. Interferon Res., 
9(Suppl 1):S61-S65 (1989); Kagawa, etal, J. Biol Chem., 263(33):17508-17515 (1988); 
Hershenson, etal, U.S. Patent No. 4,894,330; Jayaram, etal, J. Interferon Res. , 3(2):177- 
180 (1983); Menge, et al, Develop. Biol. Standard., 66:391-401 (1987); Vonk, et al, J. 
Interferon Res., 3(2):169-175 (1983); and Adolf, et al.,J. Interferon Res., 10:255-267 (1990). 

[0214] In an exemplary interferon conjugate, interferon alpha, e.g., interferon alpha 2b and 
2a, is conjugated to a water soluble polymer through an intact glycosyl linker. 

[0215] In a further exemplary embodiment; the invention provides a conjugate of human 
granulocyte colony stimulating factor (G-CSF). G-CSF is a glycoprotein that stimulates 
proliferation, differentiation and activation of neutropoietic progenitor cells into functionally 
mature neutrophils. Injected G-CSF is rapidly cleared from the body. See, for example, 
Nohynek, et al., Cancer Chemother. Pharmacol, 39:259-266 (1997); Lord, et al., Clinical 
Cancer Research, 7(7):2085-2090 (07/2001); Rotondaro, et al., Molecular Biotechnology, 
11(2): 117-128 (1999); and Bonig, et al., Bone Marrow Transplantation, 28: 259-264 (2001). 

[0216] The present invention encompasses a method for the modification of GM-CSF. 
GM-CSF is well known in the art as a cytokine produced by activated T-cells, macrophages, 
endothelial cells, and stromal fibroblasts. GM-CSF primarily acts on the bone marrow to 
increase the production of inflammatory leukocytes, and further functions as an endocrine 
hormone to initiate the replenishment of neutrophils consumed during inflammatory 
functions. Further GM-CSF is a macrophage-activating factor and promotes the 
differentiation of Lagerhans cells into dendritic cells. Like G-CSF, GM-CSF also has clinical 

t 

applications in bone marrow replacement following chemotherapy 

[0217] In addition to providing conjugates that are formed through an enzymatically added 
intact glycosyl linking group, the present invention provides conjugates that are highly 
homogenous in their substitution patterns. Using the methods of the invention, it is possible 
to form peptide conjugates in which essentially all of the modified sugar moieties across a 
population of conjugates of the invention are attached to a structurally identical amino acid or 
glycosyl residue. Thus, in a second aspect, the invention provides a peptide conjugate having 
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a population of water-soluble polymer moieties, which are covalently bound to the peptide 
through an intact glycosyl linking group. In another conjugate of the invention, essentially 
each member of the population is bound via the glycosyl linking group to a glycosyl residue 
of the peptide, and each glycosyl residue of the peptide to which the glycosyl linking group is 
5 attached has the same structure. 

[0218] Also provided is a peptide conjugate having a population of water-soluble polymer 
moieties covalently bound thereto through a glycosyl linking group. In another embodiment, 
essentially every member of the population of water soluble polymer moieties is bound to an 
amino acid residue of the peptide via an intact glycosyl linking group, and each amino acid 
10 residue having an intact glycosyl linking group attached thereto has the same structure. 

[0219] The present invention also provides conjugates analogous to those described above 
in which the peptide is conjugated to a therapeutic moiety, diagnostic moiety, targeting 
moiety, toxin moiety or the like via a glycosyl linking group. Each of the above-recited 
moieties can be a small molecule, natural polymer (e.g., polypeptide) or synthetic polymer. 

15 [0220] In a still further embodiment, the invention provides conjugates that localize 

selectively in a particular tissue due to the presence of a targeting agent as a component of the 
conjugate. In an exemplary embodiment, the targeting agent is a protein. Exemplary 
proteins include transferrin (brain, blood pool), HS-glycoprotein (bone, brain, blood pool), 
antibodies (brain, tissue with antibody-specific antigen, blood pool), coagulation factors V- 

20 XII (damaged tissue, clots, cancer, blood pool), serum proteins, e.g., ot-acid glycoprotein, 
fetuin, a-fetal protein (brain, blood pool), 02-glycoprotein (liver, atherosclerosis plaques, 
brain, blood pool), G-CSF, GM-CSF, M-CSF, and EPO (immune stimulation, cancers, blood 
pool, red blood cell overproduction, neuroprotection), albumin (increase in half-life), IL-2 
and IFN-a. 

25 [0221] In an exemplary targeted conjugate, interferon alpha 20 (IFN-a 20) is conjugated to 
transferrin via a bifunctional linker that includes an intact glycosyl linking group at each 
terminus of the PEG moiety (Scheme 1). For example, one terminus of the PEG linker is 
functionalized with an intact sialic acid linker that is attached to transferrin and the other is 
functionalized with an intact O-linked GalNAc linker that is attached to IFN-a 20. 

* 

30 [0222] The conjugates of the invention can include glycosyl linking groups that are mono- 
or multi-valent (e.g., antennary structures). Thus, conjugates of the invention include both 
species in which a selected moiety is attached to a peptide via a monovalent glycosyl linking 
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group. Also included within the invention are conjugates in which more than one selected 
moiety is attached to a peptide via a multivalent linking group. 



The Methods 

[0223] In addition to the conjugates discussed above, the present invention provides 
5 methods for preparing these and other conjugates. Moreover, the invention provides methods 
of preventing, curing or ameliorating a disease state by administering a conjugate of the 
invention to a subject at risk of developing the disease or a subject that has the disease. 
Additionally, the invention provides methods for targeting conjugates of the invention to a 
particular tissue or region of the body. 

1 0 [0224] Thus, the invention provides a method of forming a covalent conjugate between a 
selected moiety and a peptide. In exemplary embodiments, the conjugate is formed between 
a water-soluble polymer, a therapeutic moiety, targeting moiety or a biomolecule, and a 
glycosylated or non-glycosylated peptide. The polymer, therapeutic moiety or biomolecule is 
conjugated to the peptide via a glycosyl linking group, which is interposed between, and 

15 covalently linked to both the peptide and the modifying group (e.g. water-soluble polymer). 
The method includes contacting the peptide with a mixture containing a modified sugar and a 
glycosyltransferase for which the modified sugar is a substrate. The reaction is conducted 
under conditions appropriate to form a covalent bond between the modified sugar and the 
peptide. The sugar moiety of the modified sugar is preferably selected from nucleotide 

20 sugars, activated sugars and sugars, which are neither nucleotides nor activated. 

[0225] The acceptor peptide (O-glycosylated or non-glycosylated) is typically synthesized 
de novo, or recombinantly expressed in a prokaryotic cell (e.g., bacterial cell, such as E. colt) 
or in a eukaryotic cell such as a mammalian, yeast, insect, fungal or plant cell. The peptide 
can be either a full-length protein or a fragment. Moreover, the peptide can be a wild type or 
25 mutated peptide. In an exemplary embodiment, the peptide includes a mutation that adds one 
or more N- or O-linked glycosylation sites to the peptide sequence. 

[0226] In an exemplary embodiment, the peptide is O-glycosylated and functionalized with a 
water-soluble polymer in the following manner. The peptide is either produced with an 
available amino acid glycosylation site or, if glycosylated, the glycosyl moiety is trimmed off 
30 to exposed the amino acid. For example, GalNAc is added to a serine or threonine and the 

galactosylated peptide is sialylated with a sialic acid-modifying group cassette using ST6Gal- 
1. Alternatively, the galactosylated peptide is galactosylated using Core- 1 -GalT- 1 and the 
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product is sialylated with a sialic acid-modifying group cassette using ST3GalTl. An 
exemplary conjugate according to this method has the following linkages: Thr-oc- 
l-GalNAc-|3-l,3-Gal-a2,3~Sia*, in which Sia* is the sialic acid-modifying group cassette. 
[0227] In the methods of the invention, such as that set forth above, using multiple enzymes 
5 and saccharyl donors, the individual glycosylation steps may be performed separately, or 
combined in a "single pot" reaction. For example, in the three enzyme reaction set forth 
above the GalNAc tranferase, GalT and SiaT and their donors may be combined in a single 
vessel. Alternatively, the GalNAc reaction can be performed alone and both the GalT and 
SiaT and the appropriate saccharyl donors added as a single step. Another mode of running 
1 0 the reactions involves adding each enzyme and an appropriate donor sequentially and 

conducting the reaction in a "single pot" motif. Combinations of each of the methods set 
forth above are of use in preparing the compounds of the invention. 

[0228] In the conjugates of the invention, the Sia-modifying group cassette can be linked to 
the Gal in an a-2,6, or a-2,3 linkage. 

15 [0229] For example, in one embodiment, G-CSF is expressed in a mammalian system and 
modified by treatment of sialidase to trim back terminal sialic acid residues, followed by 
PEGylation using ST3Gal3 and a donor of PEG-sialic acid. 

[0230] The method of the invention also provides for modification of incompletely 
glycosylated peptides that are produced recombinantly. Many recombinantly produced 

20 glycoproteins are incompletely glycosylated, exposing carbohydrate residues that may have 
undesirable properties, e.g., immunogenicity, recognition by the RES. Employing a modified 
sugar in a method of the invention, the peptide can be simultaneously further glycosylated 
and derivatized with, e.g., a water-soluble polymer, therapeutic agent, or the like. The sugar 
moiety of the modified sugar can be the residue that would properly be conjugated to the 

25 acceptor in a fully glycosylated peptide, or another sugar moiety with desirable properties. 

[0231] Peptides modified by the methods of the invention can be synthetic or wild-type 
peptides or they can be mutated peptides, produced by methods known in the art, such as site- 
directed mutagenesis. Glycosylation of peptides is typically either N-linked or O-linked. An 
exemplary N-linkage is the attachment of the modified sugar to the side chain of an 
30 asparagine residue. The tripeptide sequences asparagine-X-serine and asparagine-X- 
threonine, where X is any amino acid except proline, are the recognition sequences for 
enzymatic attachment of a carbohydrate moiety to the asparagine side chain. Thus, the 
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presence of either of these tripeptide sequences in a polypeptide creates a potential 
glycosylation site. O-linked glycosylation refers to the attachment of one sugar (e.g., N- 
acetylgalactosamine, galactose, mannose, GlcNAc, glucose, fucose or xylose) to the hydroxy 
side chain of a hydroxyamino acid, preferably serine or threonine, although unusual or non- 
5 natural amino acids, e.g., 5-hydroxyproline or 5-hydroxylysine may also be used. 

[0232] Moreover, in addition to peptides, the methods of the present invention can be 
practiced with other biological structures (e.g., glycolipids, lipids, sphingoids, ceramides, 
whole cells, and the like, containing an O-linked glycosylation site). 

[0233] Addition of glycosylation sites to a peptide or other structure is conveniently 
1 0 accomplished by altering the amino acid sequence such that it contains one or more 

glycosylation sites. The addition may also be made by the incorporation of one or more 
species presenting an -OH group, preferably serine or threonine residues, within the sequence 
of the peptide (for O-linked glycosylation sites). The addition may be made by mutation or 
by full chemical synthesis of the peptide. The peptide amino acid sequence is preferably 
1 5 altered through changes at the DNA level, particularly by mutating the DNA encoding the 
peptide at preselected bases such that codons are generated that will translate into the desired 
amino acids. The DNA mutation(s) are preferably made using methods known in the art. 

[0234] In an exemplary embodiment, the glycosylation site is added by shuffling 
polynucleotides. Polynucleotides encoding a candidate peptide can be modulated with DNA 
20 shuffling protocols. DNA shuffling is a process of recursive recombination and mutation, 

performed by random fragmentation of a pool of related genes, followed by reassembly of the 
fragments by a polymerase chain reaction-like process. See, e.g., Stemmer, Proc. Natl Acad. 
Sci. USA 91:10747-10751 (1994); Stemmer, Nature 370:389-391 (1994); and U.S. Patent 
Nos. 5,605,793, 5,837,458, 5,830,721 and 5,811,238. 

25 [0235] The present invention also provides means of adding (or removing) one or more 

selected glycosyl residues to a peptide, after which a modified sugar is conjugated to at least 
one of the selected glycosyl residues of the peptide. The present embodiment is useful, for 
example, when it is desired to conjugate the modified sugar to a selected glycosyl residue that 
is either not present on a peptide or is not present in a desired amount. Thus, prior to 

30 coupling a modified sugar to a peptide, the selected glycosyl residue is conjugated to the 
peptide by enzymatic or chemical coupling. In another embodiment, the glycosylation 
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pattern of a glycopeptide is altered prior to the conjugation of the modified sugar by the 
removal of a carbohydrate residue from the glycopeptide. See, for example WO 98/3 1 826. 

[0236] Addition or removal of any carbohydrate moieties present on the glycopeptide is 
accomplished either chemically or enzymatically. Chemical deglycosylation is preferably 
5 brought about by exposure of the polypeptide variant to the compound 

trifluoromethanesulfonic acid, or an equivalent compound. This treatment results in the 
cleavage of most or all sugars except the linking sugar (N-acetylglucosamine or N- 
acetylgalactosamine), while leaving the peptide intact. Chemical deglycosylation is 
described by Hakimuddin et ah, Arch. Biochem. Biophys. 259: 52 (1987) and by Edge et al. s 
10 Anal. Biochem. 118: 131 (1981). Enzymatic cleavage of carbohydrate moieties on 

polypeptide variants can be achieved by the use of a variety of endo- and exo-glycosidases as 
described by Thotakura et ah, Meth. Enzymol 138: 350 (1987). 

[0237] Chemical addition of glycosyl moieties is carried out by any art-recognized method. 
Enzymatic addition of sugar moieties is preferably achieved using a modification of the 
15 methods set forth herein, substituting native glycosyl units for the modified sugars used in the 
invention. Other methods of adding sugar moieties are disclosed in U.S. Patent No. 
5,876,980, 6,030,815, 5,728,554, and 5,922,577. 

[0238] Exemplary attachment points for selected glycosyl residue include, but are not 
limited to: (a) consensus sites for N-linked glycosylation, and sites for O-linked 

20 glycosylation; (b) terminal glycosyl moieties that are acceptors for a glycosyltransferase; (c) 
arginine, asparagine and histidine; (d) free carboxyl groups; (e) free sulfhydryl groups such as 
those of cysteine; (f) free hydroxyl groups such as those of serine, threonine, or 
hydroxyproline; (g) aromatic residues such as those of phenylalanine, tyrosine, or tryptophan; 
or (h) the amide group of glutamine. Exemplary methods of use in the present invention are 

25 described in WO 87/05330 published Sep. 1 1, 1987, and in Aplin and Wriston, CRC Crit. 
Rev. Biochem., pp. 259-306 (1981). 

[0239] In one embodiment, the invention provides a method for linking two or more 
peptides through a linking group. The linking group is of any useful structure and may be 
selected from straight- and branched-chain structures. Preferably, each terminus of the 
30 linker, which is attached to a peptide, includes a modified sugar (i.e., a nascent intact glycosyl 
linking group). 
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[0240] In an exemplary method of the invention, two peptides are linked together via a 
linker moiety that includes a PEG linker. The construct conforms to the general structure set 
forth in the cartoon above. As described herein, the construct of the invention includes two 
intact glycosyl linking groups (i.e., s + 1 = 1). The focus on a PEG linker that includes two 
glycosyl groups is for purposes of clarity and should not be interpreted as limiting the identity 
of linker arms of use in this embodiment of the invention. 

[0241] Thus, a PEG moiety is functionalized at a first terminus with a first glycosyl unit 
and at a second terminus with a second glycosyl unit. The first and second glycosyl units are 
preferably substrates for different transferases, allowing orthogonal attachment of the first 
and second peptides to the first and second glycosylunits, respectively. In practice, the 
(glycosyiy-PEG-Cglycosyl) 2 linker is contacted with the first peptide and a first transferase 
for which the first glycosyl unit is a substrate, thereby forming 

(peptide) 1 -(glycosyl) 1 -PEG-(glycosyl) 2 . Transferase and/or unreacted peptide is then 
optionally removed from the reaction mixture. The second peptide and a second transferase 
for which the second glycosyl unit is a substrate are added to the 
(peptide) 1 -(glycosyl) 1 -PEG-(glycosyl) 2 conjugate, forming 

(peptide) 1 -(glycosyl) 1 -PEG-(glycosyl) 2 -(peptide) 2 ; at least one of the glycosyl residues is 
either directly or indirectly O-linked. Those of skill in the art will appreciate that the method 
outlined above is also applicable to forming conjugates between more than two peptides by, 
for example, the use of a branched PEG, dendrimer, poly(amino acid), polsaccharide or the 
like 

[0242] In an exemplary embodiment, interferon alpha 2(3 (IFN-a 20) is conjugated to 
transferrin via a Afunctional linker that includes an intact glycosyl linking group at each 
terminus of the PEG moiety (Scheme 1). The IFN conjugate has an in vivo half-life that is 
increased over that of IFN alone by virtue of the greater molecular sized of the conjugate. 
Moreover, the conjugation of IFN to transferrin serves to selectively target the conjugate to 
the brain. For example, one terminus of the PEG linker is functionalized with a CMP sialic 
acid and the other is functionalized with an UDP GalNAc. The linker is combined with IFN 
in the presence of a GalNAc transferase, resulting in the attachment of the GalNAc of the 
linker arm to a serine and/or threonine residue on the IFN. 

Scheme 1 
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[0243] The processes described above can be carried through as many cycles as desired, 
and is not limited to forming a conjugate between two peptides with a single linker. 
Moreover, those of skill in the art will appreciate that the reactions functionalizing the intact 
glycosyl linking groups at the termini of the PEG (or other) linker with the peptide can occur 
simultaneously in the same reaction vessel, or they can be carried out in a step-wise fashion. 
When the reactions are carried out in a step-wise manner, the conjugate produced at each step 
is optionally purified from one or more reaction components (e.g., enzymes, peptides). 

[0244] A still further exemplary embodiment is set forth in Scheme 2. Scheme 2 shows a 
method of preparing a conjugate that targets a selected protein, e.g., GM-CSF, to bone and 
increases the circulatory half-life of the selected protein. 



Scheme 2 
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in which G is a glycosyl residue on an activated sugar moiety (e.g., sugar nucleotide), which 
is converted to an intact glycosyl linker group in the conjugate. When s is greater than 0 5 L is 
a saccharyl linking group such as GalNAc, or GalNAc-Gal. 

[0245] The use of reactive derivatives of PEG (or other linkers) to attach one or more 
5 peptide moieties to the linker is within the scope of the present invention. The invention is 
not limited by the identity of the reactive PEG analogue. Many activated derivatives of 
poly(ethyleneglycol) are available commercially and in the literature. It is well within the 
abilities of one of skill to choose, and synthesize if necessary, an appropriate activated PEG 
derivative with which to prepare a substrate useful in the present invention. See, Abuchowski 
10 et al Cancer Biochem. Biophys., 7: 175-186 (1984); Abuchowski et al, J. Biol Chem., 252: 
3582-3586 (1977); Jackson et al, Anal Biochem., 165: 1 14-127 (1987); Koide et al, 
Biochem Biophys. Res. Commun., Ill: 659-667 (1983)), tresylate (Nilsson et al, Methods 
EnzymoL, 104: 56-69 (1984); Delgado etal, Biotechnol Appl Biochem., 12: 119-128 

(1990) ); N-hydroxysuccinimide derived active esters (Buckmann et al, Makromol Chem., 
15 182: 1379-1384 (1981); Joppich £tf a/., Makromol Chem., 180: 1381-1384 (1979); 

Abuchowski etal, Cancer Biochem. Biophys., 7: 175-186 (1984); Katrcetal Proc. Natl 
Acad Scl U.S.A., 84: 1487-1491 (1987); Kitamura et al, Cancer Res., 51: 4310-4315 

(1991) ; Boccu et al , Z Naturforsch., 38C: 94-99 (1983), carbonates (Zalipsky et al, 

POLY(ETHYLENE GLYCOL) CHEMISTRY: BlOTECHNICAL AND BIOMEDICAL APPLICATIONS, 

20 Harris, Ed., Plenum Press, New York, 1992, pp. 347-370; Zalipsky et al, Biotechnol Appl 
Biochem., 15: 100-114 (1992); Veronese etal, Appl Biochem. Biotech., 11: 141-152 
(1985)), imidazolyl formates (Beauchamp etal, Anal Biochem., 131: 25-33 (1983); Berger 
et al, Blood, 71: 1641-1647 (1988)), 4-dithiopyridines (Woghiren et al, Bioconjugate 
Chem., 4: 314-318 (1993)), isocyanates (Byun et al,ASAIO Journal, M649-M-653 (1992)) 

25 and epoxides (U.S. Pat. No. 4,806,595, issued to Noishiki et al, (1989). Other linking groups 
include the urethane linkage between amino groups and activated PEG. See, Veronese, et al, 
Appl Biochem. Biotechnol, 11: 141-152 (1985). 

[0246] In another exemplary embodiment in which a reactive PEG derivative is utilized, 
the invention provides a method for extending the blood-circulation half-life of a selected 
30 peptide, in essence targeting the peptide to the blood pool, by conjugating the peptide to a 
synthetic or natural polymer of a size sufficient to retard the filtration of the protein by the 
glomerulus (e.g., albumin). See, Scheme 3. This embodiment of the invention is illustrated 
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in Scheme in which G-CSF is conjugated to albumin via a PEG linker using a combination of 
chemical and enzymatic modification. 



Scheme 3 
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[0247] Thus, as shown in Scheme 3, a residue (e.g., amino acid side chain) of albumin is 

i 

modified with a reactive PEG derivative, such as X-PEG-(CMP-sialic acid), in which X is an 
activating group (e.g, active ester, isothiocyanate, etc). The PEG derivative and G-CSF are 
combined and contacted with a transferase for which CMP-sialic acid is a substrate. In a 
further illustrative embodiment, an s-amine of lysine is reacted with the N- 
hydroxysuccinimide ester of the PEG-linker to form the albumin conjugate. The CMP-sialic 
acid of the linker is enzymatically conjugated to an appropriate residue on GCSF, e.g, Gal, or 
GalNAc thereby forming the conjugate. Those of skill will appreciate that the above- 
described method is not limited to the reaction partners set forth. Moreover, the method can 
be practiced to form conjugates that include more than two protein moieties by, for example, 
utilizing a branched linker having more than two termini. 

Modified Sugars 

[0248] Modified glycosyl donor species ("modified sugars") are preferably selected from 
modified sugar nucleotides, activated modified sugars and modified sugars that are simple 
saccharides that are neither nucleotides nor activated. Any desired carbohydrate structure can 
be added to a peptide using the methods of the invention. Typically, the structure will be a 
monosaccharide, but the present invention is not limited to the use of modified 
monosaccharide sugars; oligosaccharides and polysaccharides are useful as well. 

[0249] The modifying group is attached to a sugar moiety by enzymatic means, chemical 
means or a combination thereof, thereby producing a modified sugar. The sugars are 



65 



WO 2005/070138 PCT/US2005/000799 

substituted at any position that allows for the attachment of the modifying moiety, yet which 
still allows the sugar to function as a substrate for the enzyme used to ligate the modified 
sugar to the peptide. In another embodiment, when sialic acid is the sugar, the sialic acid is 
substituted with the modifying group at either the 9-position on the pyruvyl side chain or at 
5 the 5-position on the amine moiety that is normally acetylated in sialic acid. 

[0250] In certain embodiments of the present invention, a modified sugar nucleotide is 
utilized to add the modified sugar to the peptide. Exemplary sugar nucleotides that are used 
in the present invention in their modified form include nucleotide mono-, di~ or triphosphates 
or analogs thereof. In another embodiment, the modified sugar nucleotide is selected from a 
10 UDP-glycoside, CMP-glycoside, or a GDP-glycoside. Even more preferably, the modified 
sugar nucleotide is selected from an UDP-galactose, UDP-galactosamine, UDP-glucose, 

4 

UDP-glucosamine, GDP-mannose, GDP-fucose, CMP-sialic acid, or CMP-NeuAc. N- 
acetylamine derivatives of the sugar nucletides are also of use in the method of the invention. 

[0251] The invention also provides methods for synthesizing a modified peptide using a 
15 modified sugar, e.g., modified-galactose, -fucose, -GalNAc and -sialic acid. When a 

modified sialic acid is used, either a sialyltransferase or a trans-sialidase (for oc2,3 -linked 
sialic acid only) can be used in these methods. 

[0252] In other embodiments, the modified sugar is an activated sugar. Activated modified 
sugars, which are useful in the present invention are typically glycosides which have been 

20 synthetically altered to include an activated leaving group. As used herein, the term 

"activated leaving group" refers to those moieties, which are easily displaced in enzyme- 
regulated nucleophilic substitution reactions. Many activated sugars are known in the art. 
See, for example, Vocadlo et al., In Carbohydrate Chemistry and Biology, Vol. 2, Ernst 
et al Ed., Wiley- VCH Verlag: Weinheim, Germany, 2000; Kodama et al, Tetrahedron Lett. 

25 34: 6419 (1993); Lougheed, et al, J. Biol Chem. 274: 37717 (1999)). 

[0253] Examples of activating groups (leaving groups) include fluoro, chloro, bromo, 
tosylate ester, mesylate ester, triflate ester and the like. Preferred activated leaving groups, 
for use in the present invention, are those that do not significantly sterically encumber the 
enzymatic transfer of the glycoside to the acceptor. Accordingly, preferred embodiments of 
30 activated glycoside derivatives include glycosyl fluorides and glycosyl mesylates, with 

glycosyl fluorides being particularly preferred. Among the glycosyl fluorides, ot-galactosyl 
fluoride, a-mannosyl fluoride, a-glucosyl fluoride, a-fucosyl fluoride, a-xylosyl fluoride, a- 
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sialyl fluoride, a-N-acetylglucosaminyl fluoride, a-N-acetylgalactosaminyl fluoride, p- 
galactosyl fluoride, P-mannosyl fluoride, P-glucosyl fluoride, p-fucosyl fluoride, P-xylosyl 
fluoride, p-sialyl fluoride, p-N-acetylglucosaminyl fluoride and P-N-acetylgalactosaminyl 
fluoride are most preferred. 

5 [0254] By way of illustration, glycosyl fluorides can be prepared from the free sugar by 
first acetylating the sugar and then treating it with HF/pyridine. This generates the 
thermodynamically most stable anomer of the protected (acetylated) glycosyl fluoride (i.e., 
the a-glycosyl fluoride). If the less stable anomer (i.e., the p-glycosyl fluoride) is desired, it 
can be prepared by converting the peracetylated sugar with HBr/HOAc or with HCI to 
1 0 generate the anomeric bromide or chloride. This intermediate is reacted with a fluoride salt 
such as silver fluoride to generate the glycosyl fluoride. Acetylated glycosyl fluorides may 
be deprotected by reaction with mild (catalytic) base in methanol (e.g. NaOMe/MeOH). In 
addition, many glycosyl fluorides are commercially available. 

[0255] Other activated glycosyl derivatives can be prepared using conventional methods 
15 known to those of skill in the art. For example, glycosyl mesylates can be prepared by 

treatment of the fully benzylated hemiacetal form of the sugar with mesyl chloride, followed 
by catalytic hydro genation to remove the benzyl groups. 

[0256] In a further exemplary embodiment, the modified sugar is an oligosaccharide having 
an antennary structure. In another embodiment, one or more of the termini of the antennae 

20 bear the modifying moiety. When more than one modifying moiety is attached to an 

oligosaccharide having an antennary structure, the oligosaccharide is useful to "amplify" the 
modifying moiety; each oligosaccharide unit conjugated to the peptide attaches multiple 
copies of the modifying group to the peptide. The general structure of a typical conjugate of 
the invention as set forth in the drawing above, encompasses multivalent species resulting 

25 from preparing a conjugate of the invention utilizing an antennary structure. Many antennary 
saccharide structures are known in the art, and the present method can be practiced with them 
without limitation. 

[0257] Exemplary modifying groups are discussed below. The modifying groups can be 
selected for their ability to impart to a peptide one or more desirable property. Exemplary 
30 properties include, but are not limited to, enhanced pharmacokinetics, enhanced 

pharmacodynamics, improved biodistribution, providing a polyvalent species, improved 
water solubility, enhanced or diminished lipophilicity, and tissue targeting. 
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Water-Soluble Polymers 

[0258] Many water-soluble polymers are known to those of skill in the art and are useful in 
practicing the present invention. The term water-soluble polymer encompasses species such 
as saccharides {e.g., dextran, amylose, hyalouronic acid 5 poly(sialic acid), heparans, heparins, 
5 etc.); poly (amino acids), e.g., poly(aspartic acid) and poly(glutamic acid); nucleic acids; 
synthetic polymers (e.g., poly(acrylic acid), poly(ethers), e.g., poly(ethylene glycol); 
peptides, proteins, and the like. The present invention may be practiced with any water- 
soluble polymer with the sole limitation that the polymer must include a point at which the 
remainder of the conjugate can be attached. 

10 [0259] Methods for activation of polymers can also be found in WO 94/17039, U.S. Pat. No. 
5,324,844, WO 94/18247, WO 94/04193, U.S. Pat. No. 5,219,564, U.S. Pat. No. 5,122,614, 
WO 90/13540, U.S. Pat. No. 5,281,698, and more WO 93/151 89, and for conjugation 
between activated polymers and peptides, e.g. Coagulation Factor VIII (WO 94/15625), 
hemoglobin (WO 94/09027), oxygen carrying molecule (U.S. Pat. No. 4,412,989), 

15 ribonuclease and superoxide dismutase (Veronese at al, App. Biochem. Biotech. 11: 141-45 
(1985)). 

[0260] Preferred water-soluble polymers are those in which a substantial proportion of the 
polymer molecules in a sample of the polymer are of approximately the same molecular 
weight; such polymers are "homodisperse." 

20 [0261] The present invention is further illustrated by reference to a poly(ethylene glycol) 

conjugate. Several reviews and monographs on the functionalization and conjugation of PEG 
are available. See, for example, Harris, Macronol. Chem. Phys. C25: 325-373 (1985); 
Scouten, Methods in Enzymology 135: 30-65 (1987); Wong et al, Enzyme Microb. Technol. 
14: 866-874 (1992); Delgado et al, Critical Reviews in Therapeutic Drug Carrier Systems 9: 

25 249-304 (1992); Zalipsky 5 Bioconjugate Chem. 6: 150-165 (1995); and Bhadra, et al, 
Pharmazie, 57:5-29 (2002). Routes for preparing reactive PEG molecules and forming 
conjugates using the reactive molecules are known in the art. For example, U.S. Patent No. 
5,672,662 discloses a water soluble and isolatable conjugate of an active ester of a polymer 
acid selected from linear or branched poly(alkylene oxides), poly(oxyethylated polyols), 

30 poly(olefinic alcohols), and poly(acrylomorpholine). 

[0262] U.S. Patent No. 6,376,604 sets forth a method for preparing a water-soluble 
1-benzotriazolylcarbonate ester of a water-soluble and non-peptidic polymer by reacting a 
terminal hydroxyl of the polymer with di(l-benzotriazoyl)carbonate in an organic solvent. 
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The active ester is used to form conjugates with a biologically active agent such as a protein 
or peptide. 

[0263] WO 99/45964 describes a conjugate comprising a biologically active agent and an 
activated water soluble polymer comprising a polymer backbone having at least one terminus 
5 linked to the polymer backbone through a stable linkage, wherein at least one terminus 
comprises a branching moiety having proximal reactive groups linked to the branching 
moiety, in which the biologically active agent is linked to at least one of the proximal reactive 
groups. Other branched poly(ethylene glycols) are described in WO 96/21469, U.S. Patent 
No. 5,932,462 describes a conjugate formed with a branched PEG molecule that includes a 

10 branched terminus that includes reactive functional groups. The free reactive groups are 
available to react with a biologically active species, such as a protein or peptide, forming 
conjugates between the poly(ethylene glycol) and the biologically active species. U.S. Patent 
No. 5,446,090 describes a Afunctional PEG linker and its use in forming conjugates having a 
peptide at each of the PEG linker termini. 

15 [0264] Conjugates that include degradable PEG linkages are described in WO 99/34833; and 
WO 99/14259, as well as in U.S. Patent No. 6,348,558. Such degradable linkages are 
applicable in the present invention. 

[0265] The art-recognized methods of polymer activation set forth above are of use in the 
context of the present invention in the formation of the branched polymers set forth herein 
20 and also for the conjugation of these branched polymers to other species, e.g., sugars, sugar 
nucleotides and the like. 

[0266] Exemplary poly(ethylene glycol) molecules of use in the invention include, but are 

not limited to, those having the formula: 

Y 



(CH 2 ) b — X(CH 2 CH 2 0)e(CH 2 )d— A 1 — R 8 

25 in which R 8 is H, OH, NH 2 , substituted or unsubstituted alkyl, substituted or unsubstituted 
aryl, substituted or unsubstituted heteroaryl, substituted or unsubstituted heterocycloalkyl, 
substituted or unsubstituted heteroalkyl, e.g., acetal, OHC-, H 2 N-(CH 2 ) q - ? HS~(CH 2 ) q , or 
-(CH2) q C(Y)Z 1 . The index "e" represents an integer from 1 to 2500. The indices b, d, andq 
independently represent integers from 0 to 20. The symbols Z and Z 1 independently 

30 represent OH, NH 2 , leaving groups, e.g., imidazole, p-nitrophenyl, HOBT, tetrazole, halide, 
S-R 9 , the alcohol portion of activated esters; -(CH 2 ) P C(Y 1 )V, or -(CH 2 )pU(CH 2 )sC(Y 1 ) v . The 
symbol Y represents H(2), =0, =S, =N-R 10 . The symbols X, Y, Y 1 , A 1 , and U independently 
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represent the moieties O, S, N-R 11 . The symbol V represents OH, NH 2 , halogen, S-R 12 , the 
alcohol component of activated esters, the amine component of activated amides, sugar- 
nucleotides, and proteins. The indices p, q, s and v are members independently selected 
from the integers from 0 to 20. The symbols R 9 , R 10 , R 11 and R 12 independently represent H, 
substituted or unsubstituted alkyl, substituted or unsubstituted heteroalkyl, substituted or 
unsubstituted aryl, substituted or unsubstituted heterocycloalkyl and substituted or 
unsubstituted heteroaryl. 

[0267] In other exemplary embodiments, the poly(ethylene glycol) molecule is selected from 
the following: 



Me-(OCH 2 CH 2 ) e -0 




O 



Me-(OCH 2 CH 2 ) e — O 




Me-(OCH 2 CH 2 ) e -0 




Me-(OCH 2 CH 2 ) e — (X .Z 

n 

O 

H 

Me-(OCH 2 CH 2 ) e -N^^ 0 ^^ 

O O 

H O 
Me-(OCH 2 CH 2 ) e N 

n t z 
o 



Me-(OCH 2 CH 2 ) e — S— Z 

H 

Me-(OCH 2 CH 2 ) e — N— Z 



Me-(OCH 2 CH 2 ) e HN 

O 




[0268] The poly(ethylene glycol) useful in forming the conjugate of the invention is either 
linear or branched. Branched poly(ethylene glycol) molecules suitable for use in the 
invention include, but are not limited to, those described by the following formula: 



R 8 -AV>(OCH 2 CH 2 ) e -X 1 



N 



(CH 2 ) 




R 8 '-A2^(OCH 2 CH 2 )f -X 1 



in which R and R are members independently selected from the groups defined for R 8 , 
above. A and A are members independently selected from the groups defined for A , 
above. The indices e, f, o, and q are as described above. Z and Y are as described above. 
X 1 and X 1 ' are members independently selected from S, SC(0)NH, HNC(0)S, SC(0)0, O, 
NH, NHC(O), (O)CNH and NHC(0)0, OC(0)NH. 
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[0269] In other exemplary embodiments, the branched PEG is based upon a cysteine, serine 
or di-lysine core. Thus, further exemplary branched PEGs include: 





NHC(0)OCH 2 CH 2 (OCH 2 CH 2 ) e OCH 3 



NHC(0)OCH 2 CH 2 (OCH 2 CH 2 ),OCH 3 



NHC(0)CH 2 CH 2 (OCH 2 CH2) e OCH 3 



S (CH 2 CH 2 0) e CH 3 

NHC(0)CH 2 CH 2 (OCH 2 CH 2 )fOCH 3 



HC(0)CH 2 CH 2 (OCH 2 CH 2 ) 9 OCH 3 




HO' (CH 2 CH 2 O) 0 CH 3 

NHC(0)OCH 2 CH2(OCH 2 CH2)fOCH 3 




0 (CH 2 CH 2 0) e CH 3 

NHC(0)CH 2 CH 2 (OCH 2 CH 2 ) f OCH3 




O (CH 2 CH 2 0) e CH 3 



NHC(0)OCH 2 CH2(OCH2CH 2 )fOCH3 




O (CH 2 CH 2 0) e CH 3 

NHC(0)CH 2 CH 2 OCH 3 



HO 




S (CH 2 CH 2 0) e CH 3 

NHC(0)OCH 3 ' 

; and 




S (CH 2 CH 2 0) e CH 3 



NHC(0)CH 3 



[0270] In yet another embodiment, the branched PEG moiety is based upon a tri-lysine 
5 peptide. The tri-lysine can be mono-, di-, tri-, or tetra-PEG-ylated. Exemplary species 
according to this embodiment have the formulae: 
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NHC(0)OCH 2 CH 2 (OCH 2 CH 2 ) e OCH3 




NHC(0)OCH 2 CH 2 (OCH 2 CH 2 ) f OCH 3 



HC(0)OCH 2 CH 2 (OCH 2 CH 2 ) r OCH 3 



and 



10 



15 



NHC(0)CH 2 CH 2 (OCH 2 CH 2 ) e OCH 3 




NHC(0)CH 2 CH 2 (OCH 2 CH 2 ) f OCH 3 



HC(0)CH 2 CH 2 (OCH 2 CH 2 ) f OCH 3 



in which e ? f and f are independently selected integers from 1 to 2500; and q, q' and q" are 
independently selected integers from 1 to 20. 

[0271] In exemplary embodiments of the invention, the PEG is m-PEG (5 kD 5 10 kD, or 
20kD). An exemplary branched PEG species is a serine- or cysteine-(m-PEG)2 in which the 
m-PEG is a 20 kD m-PEG. 

[0272] As will be apparent to those of skill, the branched polymers of use in the invention 
include variations on the themes set forth above. For example the di-lysine-PEG conjugate 
shown above can include three polymeric subunits, the third bonded to the a-amine shown as 
unmodified in the structure above. Similarly, the use of a tri-lysine functionalized with three 
or four polymeric subunits is within the scope of the invention. 
[0273] Specific embodiments according to the invention include: 



Me' 




OH 



Me' 



H 2 N 




OH 



and 



Me 



e 




OH 
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and carbonates and active esters of these species, such as: 



Me' 




:^ 0 )^cAo ° F 



Me' 




° F 



[0274] Other activating, or leaving groups, appropriate for activating linear PEGs of use in 
preparing the compounds set forth herein include, but are not limited to the species: 





N — O 1 






\ / O HN — NH JJ 



F F 




[0275] PEG molecules that are activated with these and other species and methods of making 
the activated PEGs are set forth in WO 04/083259. 

[0276] Those of skill in the art will appreciate that one or more of the m-PEG arms of the 
branched polymer can be replaced by a PEG moiety with a different terminus, e.g., OH, 
COOH, NH 2 , C 2 -Cio-alkyl, etc. Moreover, the structures above are readily modified by 
inserting alkyl linkers (or removing carbon atoms) between the a-carbon atom and the 
functional group of the side chain. Thus, "homo" derivatives and higher homologues, as well 
as lower homologues are within the scope of cores for branched PEGs of use in the present 
invention. 
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[0277] The branched PEG species set forth herein are readily prepared by methods such as 
that set forth in the scheme below: 




in which X a is O or S and r is an integer from 1 to 5. The indices e and f are independently 
5 selected integers from 1 to 2500. 

[0278] Thus, according to this scheme, a natural or unnatural amino acid is contacted with an 
activated m-PEG derivative, in this case the tosylate, forming 1 by alkylating the side-chain 
heteroatom X a . The mono-functionalized m-PEG amino acid is submitted to N-acylation 
conditions with a reactive m-PEG derivative, thereby assembling branched m-PEG 2. As one — 

10 of skill will appreciate, the tosylate leaving group can be replaced with any suitable leaving 
group, e.g., halogen, mesylate, triflate, etc. Similarly, the reactive carbonate utilized to 
acylate the amine can be replaced with an active ester, e.g., N-hydroxysuccinimide, etc., or 
the acid can be activated in situ using a dehydrating agent such as dicyclohexylcarbodiimide, 
carbonyldiimidazole, etc. 

1 5 [0279] In an exemplary embodiment, the modifying group is a PEG moiety, however, any 
modifying group, e.g., water-soluble polymer, water-insoluble polymer, therapeutic moiety, 
etc., can be incorporated in a glycosyl moiety through an appropriate linkage. The modified 
sugar is formed by enzymatic means, chemical means or a combination thereof, thereby 
producing a modified sugar. In an exemplary embodiment, the sugars are substituted with an 

20 active amine at any position that allows for the attachment of the modifying moiety, yet still 
allows the sugar to function as a substrate for an enzyme capable of coupling the modified 
sugar to the G-CSF peptide. In an exemplary embodiment, when galactosamine is the 
modified sugar, the amine moiety is attached to the carbon atom at the 6-position. 
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Water-soluble Polymer Modified Species 

[0280] Water-soluble polymer modified nucleotide sugar species in which the sugar moiety 
is modified with a water-soluble polymer are of use in the present invention. An exemplary 
modified sugar nucleotide bears a sugar group that is modified through an amine moiety on 
5 the sugar. Modified sugar nucleotides, e.g., saccharyl-amine derivatives of a sugar 

nucleotide, are also of use in the methods of the invention. For example, a saccharyl amine 
(without the modifying group) can be enzymatically conjugated to a peptide (or other 
species) and the free saccharyl amine moiety subsequently conjugated to a desired modifying 
group. Alternatively, the modified sugar nucleotide can function as a substrate for an enzyme 
1 0 that transfers the modified sugar to a saccharyl acceptor on a substrate, e.g., a peptide, 
glycopeptide, lipid, aglycone, glycolipid, etc. 

[0281] In one embodiment in which the saccharide core is galactose or glucose, R 5 is 
NHC(0)Y. 

[0282] In an exemplary embodiment, the modified sugar is based upon a 6-amino-N-acetyl- 
1 5 glycosyl moiety. As shown below for N-acetylgalactosamine, the 6-amino-sugar moiety is 
readily prepared by standard methods. 




[0283] In the scheme above, the index n represents an integer from 1 to 2500, preferably 
from 10 to 1500, and more preferably from 10 to 1200. The symbol "A 55 represents an 
20 activating group, e.g. , a halo, a component of an activated ester (e.g. , a N- 

hydroxysuccinimide ester), a component of a carbonate (e.g., p-nitrophenyl carbonate) and 

■ 
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the like. Those of skill in the art will appreciate that other PEG-amide nucleotide sugars are 
readily prepared by this and analogous methods. 

[0284] In other exemplary embodiments, the amide moiety is replaced by a group such as a 
urethane or a urea. 

[0285] In still further embodiments, R 1 is a branched PEG, for example, one of those species 
set forth above. Illustrative compounds according to this embodiment include: 



CH(OH)CH(OH)CH 2 OH 



NHC(0)(CH 2 ) a NHC(0)X 4 (CH 2 ) b (OCH 2 CH 2 ) c O(CH 2 ) d N 





J (CH 2 CH 2 0) e CH 3 

NHC(0)X 4 CH 2 CH 2 (OCH 2 CH 2 ),OCH ; 




CH(OH)CH(OH)CH 2 OH 

O 



NHC(0)(CH 2 ) a NH 




•J (CH 2 CH 2 0) 0 CH 3 

NHC(0)X 4 CH 2 CH 2 (OCH 2 CH 2 ),OCH 3 





HOOC. .O. ^CH(OH)CH(OH)CH 2 NH^ (CH 2 CH 2 0) e CH 3 

HO I I NHC(0)X 4 CH 2 CH 2 (OCH 2 CH 2 ) f OCH 3 

NHC(0)CH 3 

OH 



CH(OH)CH(OH)CH 2 NHC(0)0(CH 2 ) b (OCH 2 CH2) c O(CH 2 ) d N 





V) (CH 2 CH z O) e CH 3 

NHC(0)X 4 CH 2 CH 2 (OCH 2 CH 2 ),OCH 3 



NHC(0)CH 3 




CH(OH)CH(OH)CH 2 NHC(0)X 4 (CH 2 ) b (OCH 2 CH 2 ) c O(CH 2 ) d NH 



NHC(0)CH : 




'J (CH 2 CH 2 0) o CH 3 

NHC(0)X 4 CH 2 CH 2 (OCH 2 CH 2 ) f OCH 3 



in which X 4 is a bond or O, and J is S or O. 

[0286] Moreover, as discussed above, the present invention provides peptide conjugates that 
are formed using nucleotide sugars that are modified with a water-soluble polymer, which is 
either straight-chain or branched. For example, compounds having the formula shown below 
are within the scope of the present invention: 
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in which X is O or a bond, and Jis S or O. 

[0287] Similarly, the invention provides peptide conjugates that are formed using nucleotide 

sugars of those modified sugar species in which the carbon at the 6-position is modified: 

O 




in which X 4 is a bond or O, J is S or O, and y is 0 or 1 . 



[0288] Also provided are conjugates of peptides and glycopeptides, lipids and glycolipids 
that include the compositions of the invention. For example, the invention provides 
conjugates having the following formulae: 
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HOOC 




HOOC 



CH(OH)CH(OH)CH 2 OH 



NHC(0)(CH 2 ) a NHC(0)X 4 (CH 2 ) b (OCH 2 CH 2 ) e O(CH 2 ) d N 





CH(OH)CH(OH)CH 2 OH 

O 



NHC(0)(CH 2 ) a NH 




J (CH 2 CH 2 0)„CH 3 

NHC(0)CH 2 CH a (OCH 2 CH 2 ) f OCH 3 



•J (CH 2 CH 2 0) 8 CH a 

NHC(0)CH 2 CH 2 (OCH 2 CH 2 ) ( OCH 3 



HOOC 




CH(OH)CH(OH)CH 2 NH 



NHC(0)CH 3 




J (CH 2 CH 2 0).CH 3 

NHC(0)CH 2 CH 2 (OCH 2 CH 2 ),OCH 3 



and 



HOOC 





O^ ^CH(OH)CH(OH)CH 2 NHC(0)X 4 (CH 2 ) b (OCH 2 CH 2 ) c O(CH 2 ) d NH^ Y ^ (CH 2 CH 2 0) e CH 3 

NHC(0)CH 2 CH 2 (OCH 2 CH 2 ),OCH 3 

NHC(0)CH 3 



wherein J s S or O. 
Water-insoluble polymers 

[0289] In another embodiment, analogous to those discussed above, the modified sugars 
include a water-insoluble polymer, rather than a water-soluble polymer. The conjugates of 
the invention may also include one or more water-insoluble polymers. This embodiment of 
the invention is illustrated by the use of the conjugate as a vehicle with which to deliver a 
therapeutic peptide in a controlled manner. Polymeric drug delivery systems are known in 
the art. See, for example, Dunn et at , Eds. Polymeric Drugs And Drug Delivery 
Systems, ACS Symposium Series Vol. 469, American Chemical Society, Washington, D.C. 
1991. Those of skill in the art will appreciate that substantially any known drug delivery 
system is applicable to the conjugates of the present invention. 
[0290] Representative water-insoluble polymers include, but are not limited to, 
polyphosphazines, polyvinyl alcohols), polyamides, polycarbonates, polyalkylenes, 
polyacrylamides, polyalkylene glycols, polyalkylene oxides, polyalkylene terephthalates, 
polyvinyl ethers, polyvinyl esters, polyvinyl halides, polyvinylpyrrolidone, polyglycolides, 
polysiloxanes, polyurethanes, poly(methyl methacrylate), poly(ethyl methacrylate), 
poly(butyl methacrylate), poly(isobutyl methacrylate), poly(hexyl methacrylate), 
poly(isodecyl methacrylate), poly(lauryl methacrylate), poly(phenyl methacrylate), 
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poly(methyl acrylate), poly(isopropyl acrylate), poly(isobutyl acrylate), poly(octadecyl 
acrylate) polyethylene, polypropylene, poly(ethylene glycol), poly(ethylene oxide), poly 
(ethylene terephthalate), polyvinyl acetate), polyvinyl chloride, polystyrene, polyvinyl 
pyrrolidone, pluronics and polyvinylphenol and copolymers thereof. 
5 [0291] Synthetically modified natural polymers of use in conjugates of the invention include, 
but are not limited to, alkyl celluloses, hydroxyalkyl celluloses, cellulose ethers, cellulose 
esters, and nitrocelluloses. Particularly preferred members of the broad classes of 
synthetically modified natural polymers include, but are not limited to, methyl cellulose, 
ethyl cellulose, hydroxypropyl cellulose, hydroxypropyl methyl cellulose, hydroxybutyl 

10 methyl cellulose, cellulose acetate, cellulose propionate, cellulose acetate butyrate, cellulose 
acetate phthalate, carboxymethyl cellulose, cellulose triacetate, cellulose sulfate sodium salt, 
and polymers of acrylic and methacrylic esters and alginic acid. 
[0292] These and the other polymers discussed herein can be readily obtained from 
commercial sources such as Sigma Chemical Co. (St. Louis, MO.), Polysciences (Warrenton, 

15 PA.), Aldrich (Milwaukee, WL), Fluka (Ronkonkoma, NY), and BioRad (Richmond, CA), or 
else synthesized from monomers obtained from these suppliers using standard techniques. 
[0293] Representative biodegradable polymers of use in the conjugates of the invention 
include, but are not limited to, polylactides, polyglycolides and copolymers thereof, 
poly(ethylene terephthalate), poly(butyric acid), poly(valeric acid), poly(lactide-co- 

20 caprolactone), poly(lactide-co-glycolide), polyanhydrides, polyorthoesters, blends and 
copolymers thereof. Of particular use are compositions that form gels, such as those 
including collagen, pluronics and the like. 

[0294] The polymers of use in the invention include "hybrid' polymers that include water- 
insoluble materials having within at least a portion of their structure, a bioresorbable 
25 molecule. An example of such a polymer is one that includes a water-insoluble copolymer, 
which has a bioresorbable region, a hydrophilic region and a plurality of crosslinkable 
functional groups per polymer chain. 

[0295] For purposes of the present invention, "water-insoluble materials" includes materials 
that are substantially insoluble in water or water-containing environments. Thus, although 
30 certain regions or segments of the copolymer may be hydrophilic or even water-soluble, the 
polymer molecule, as a whole, does not to any substantial measure dissolve in water. 
[0296] For purposes of the present invention, the term "bioresorbable molecule" includes a 
region that is capable of being metabolized or broken down and resorbed and/or eliminated 
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through normal excretory routes by the body. Such metabolites or break down products are 
preferably substantially non-toxic to the body. 

[0297] The bioresorbable region may be either hydrophobic or hydrophilic, so long as the 
copolymer composition as a whole is not rendered water-soluble. Thus, the bioresorbable 
region is selected based on the preference that the polymer, as a whole, remains water- 
insoluble. Accordingly, the relative properties, i.e., the kinds of functional groups contained 
by, and the relative proportions of the bioresorbable region, and the hydrophilic region are 
selected to ensure that useful bioresorbable compositions remain water-insoluble. 
[0298] Exemplary resorbable polymers include, for example, synthetically produced 
resorbable block copolymers of poly(a-hydroxy-carboxylic acid)/poly(oxyalkylene, (see, 
Cohn et ah, U.S. Patent No. 4,826,945). These copolymers are not crosslinked and are water- 
soluble so that the body can excrete the degraded block copolymer compositions. See, 
Younes et ah, J Biomed Mater. Res. 21: 1301-1316 (1987); and Cohn et ah, JBiomed 
Mater. Res. 22: 993-1009 (1988). 

[0299] Presently preferred bioresorbable polymers include one or more components selected 
from poly(esters), poly(hydroxy acids), poly(lactones), poly(amides), poly(ester-amides), 
poly (amino acids), poly(anhydrides), poly(orthoesters), poly(carbonates), 
poly(phosphazines), poly(phosphoesters), poly(thioesters), polysaccharides and mixtures 
thereof. More preferably still, the bioresorbable polymer includes a poly(hydroxy) acid 
component. Of the poly(hydroxy) acids, polylactic acid, polyglycolic acid, polycaproic acid, 
polybutyric acid, polyvaleric acid and copolymers and mixtures thereof are preferred. 
[0300] In addition to forming fragments that are absorbed in vivo ("bioresorbed"), preferred 
polymeric coatings for use in the methods of the invention can also form an excretable and/or 
metabolizable fragment. 

[0301] Higher order copolymers can also be used in the present invention. For example, 
Casey et ah, U.S. Patent No. 4,438,253, which issued on March 20, 1984, discloses tri-block 
copolymers produced from the transesterification of poly(glycolic acid) and an hydroxyl- 
ended poly(alkylene glycol). Such compositions are disclosed for use as resorbable 
monofilament sutures. The flexibility of such compositions is controlled by the incorporation 
of an aromatic orthocarbonate, such as tetra-p-tolyl orthocarbonate into the copolymer 
structure. 

[0302] Other polymers based on lactic and/or glycolic acids can also be utilized. For 
example, Spinu, U.S. Patent No. 5,202,413, which issued on April 13, 1993, discloses 
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biodegradable multi-block copolymers having sequentially ordered blocks of polylactide 
and/or polyglycolide produced by ring-opening polymerization of lactide and/or glycolide 
onto either an oligomeric diol or a diamine residue followed by chain extension with a di- . 
functional compound, such as, a diisocyanate, diacylchloride or dichlorosilane. 
[0303] Bioresorbable regions of coatings useful in the present invention can be designed to 
be hydrolytically and/or enzymatically cleavable. For purposes of the present invention, 
"hydrolytically cleavable" refers to the susceptibility of the copolymer, especially the 
bioresorbable region, to hydrolysis in water or a water-containing environment. Similarly, 
"enzymatically cleavable" as used herein refers to the susceptibility of the copolymer, 
especially the bioresorbable region, to cleavage by endogenous or exogenous enzymes. 
[0304] When placed within the body, the hydrophilic region can be processed into excretable 
and/or metabolizable fragments. Thus, the hydrophilic region can include, for example, 
polyethers, polyalkylene oxides, polyols, polyvinyl pyrrolidine), polyvinyl alcohol), 
poly(alkyl oxazolines), polysaccharides, carbohydrates, peptides, proteins and copolymers 
and mixtures thereof. Furthermore, the hydrophilic region can also be, for example, a 
poly(alkylene) oxide. Such poly(alkylene) oxides can include, for example, poly(ethylene) 
oxide, poly(propylene) oxide and mixtures and copolymers thereof. 

[0305] Polymers that are components of hydrogels are also useful in the present invention. 
Hydrogels are polymeric materials that are capable of absorbing relatively large quantities of 
water. Examples of hydrogel forming compounds include, but are not limited to, polyacrylic 
acids, sodium carboxymethylcellulose, polyvinyl alcohol, polyvinyl pyrrolidine, gelatin, 
carrageenan and other polysaccharides, hydroxyethylenemethacrylic acid (HEMA), as well as 
derivatives thereof, and the like. Hydrogels can be produced that are stable, biodegradable 1 
and bioresorbable. Moreover, hydrogel compositions can include subunits that exhibit one or 
more of these properties. 

[0306] Bio-compatible hydrogel compositions whose integrity can be controlled through 
crosslinking are known and are presently preferred for use in the methods of the invention. 
For example, Hubbell et a/., U.S. Patent Nos. 5,410,016, which issued on April 25, 1995 and 
5,529,914, which issued on June 25, 1996, disclose water-soluble systems, which are 
crosslinked block copolymers having a water-soluble central block segment sandwiched 
between two hydrolytically labile extensions. Such copolymers are further end-capped with 
photopolymerizable acrylate functionalities. When crosslinked, these systems become 
hydrogels. The water soluble central block of such copolymers can include poly(ethylene 
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glycol); whereas, the hydrolytically labile extensions can be a poly(oc~hydroxy acid), such as 
polyglycolic acid or polylactic acid. See, Sawhney et ah, Macromolecules 26: 581-587 
(1993). 

[0307] In another embodiment, the gel is a thermoreversible gel. Thermoreversible gels 
5 including components, such as pluronics, collagen, gelatin, hyaluronic acid, 

polysaccharides, polyurethane hydrogel, polyurethane-urea hydrogel and combinations 
thereof are presently preferred. 

[0308] In yet another exemplary embodiment, the conjugate of the invention includes a 
component of a liposome. Liposomes can be prepared according to methods known to those 

10 skilled in the art, for example, as described in Eppstein et ah, U.S. Patent No. 4,522,81 1, 
which issued on June 1 1, 1985. For example, liposome formulations ma^ be prepared by 
dissolving appropriate lipid(s) (such as stearoyl phosphatidyl ethanolamine, stearoyl 
phosphatidyl choline, arachadoyl phosphatidyl choline, and cholesterol) in an inorganic 
solvent that is then evaporated, leaving behind a thin film of dried lipid on the surface of the 

1 5 container. An aqueous solution of the active compound or its pharmaceutical^ acceptable 
salt is then introduced into the container. The container is then swirled by hand to free lipid 
material from the sides of the container and to disperse lipid aggregates, thereby forming the 
liposomal suspension. 

[0309] The above-recited microparticles and methods of preparing the microparticles are 
20 offered by way of example and they are not intended to define the scope of microparticles of 
use in the present invention. It will be apparent to those of skill in the art that an array of 
microparticles, fabricated by different methods, are of use in the present invention. 
[0310] The structural formats discussed above in the context of the water-soluble polymers, 
both straight-chain and branched are generally applicable with respect to the water-insoluble 
25 polymers as well. Thus, for example, the cysteine, serine, dilysine, and trilysine branching 
cores can be functionalized with two water-insoluble polymer moieties. The methods used to 
produce these species are generally closely analogous to those used to produce the water- 
soluble polymers. 

[0311] The in vivo half-life of therapeutic glycopeptides can also be enhanced with PEG 
30 moieties such as polyethylene glycol (PEG). For example, chemical modification of proteins 
with PEG (PEGylation) increases their molecular size and decreases their surface- and 
functional group-accessibility, each of which are dependent on the size of the PEG attached 
to the protein. This results in an improvement of plasma half-lives and in proteolytic- 
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stability, and a decrease in immunogenicity and hepatic uptake (Chaffee et al J. Clin. Invest. 
89: 1643-1651 (1992); Pyatak*tfa/. Res. Commun. Chem. Pathol Pharmacol 29: 113-127 
(1980)). PEGylation of interleukin-2 has been reported to increase its antitumor potency in 
vivo (Katre et al Proc. Natl Acad. Sci. USA. 84: 1487-1491 (1987)) and PEGylation of a 
5 F(ab')2 derived from the monoclonal antibody A7 has improved its tumor localization 

(Kitamura et al Biochem. Biophys. Res. Commun. 28: 1387-1394 (1990)). Thus, in another 
embodiment, the in vivo half-life of a peptide derivatized with a PEG moiety by a method of 
the invention is increased relevant to the in vivo half-life of the non-derivatized peptide. 
[0312] The increase in peptide in vivo half-life is best expressed as a range of percent 
1 0 increase in this quantity. The lower end of the range of percent increase is about 40%, about 
60%, about 80%, about 100%, about 150%or about 200%. The upper end of the range is 
about 60%, about 80%, about 100%, about 150%, or more than about 250%. 

Biomolecules 

[0313] In another embodiment, the modified sugar bears a biomolecule. In still further 
1 5 embodiments, the biomolecule is a functional protein, enzyme, antigen, antibody, peptide, 
nucleic acid (e.g., single nucleotides or nucleosides, oligonucleotides, polynucleotides and 
single- and higher-stranded nucleic acids), lectin, receptor or a combination thereof. 

[0314] Preferred biomolecules are essentially non-fluorescent, or emit such a minimal 
amount of fluorescence that they are inappropriate for use as a fluorescent marker in an assay. 

20 Moreover, it is generally preferred to use biomolecules that are not sugars. An exception to 
this preference is the use of an otherwise naturally occurring sugar that is modified by 
covalent attachment of another entity (e.g., PEG, biomolecule, therapeutic moiety, diagnostic 
moiety, etc.). In an exemplary embodiment, a sugar moiety, which is a biomolecule, is 
conjugated to a linker arm and the sugar-linker arm cassette is subsequently conjugated to a 

25 peptide via a method of the invention. 

[0315] Biomolecules useful in practicing the present invention can be derived from any 
source. The biomolecules can be isolated from natural sources or they can be produced by 
synthetic methods. Peptides can be natural peptides or mutated peptides. Mutations can be 
effected by chemical mutagenesis, site-directed mutagenesis or other means of inducing 
30 mutations known to those of skill in the art. Peptides useful in practicing the instant 

invention include, for example, enzymes, antigens, antibodies and receptors. Antibodies can 
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be either polyclonal or monoclonal; either intact or fragments. The peptides are optionally 
the products of a program of directed evolution 

[0316] Both naturally derived and synthetic peptides and nucleic acids are of use in 
conjunction with the present invention; these molecules can be attached to a sugar residue 
5 component or a crosslinking agent by any available reactive group. For example, peptides 
can be attached through a reactive amine, carboxyl, sulfhydryl, or hydroxyl group. The 
reactive group can reside at a peptide terminus or at a site internal to the peptide chain. 
Nucleic acids can be attached through a reactive group on a base (e.g., exocyclic amine) or an 

« 

available hydroxyl group on a sugar moiety (e.g., 3'- or 5 '-hydroxyl). The peptide and 
1 0 nucleic acid chains can be further derivatized at one or more sites to allow for the attachment 
of appropriate reactive groups onto the chain. See, Chrisey et al Nucleic Acids Res. 24: 
3031-3039 (1996). 

[0317] In a further embodiment, the biomolecule is selected to direct the peptide modified 
by the methods of the invention to a specific tissue, thereby enhancing the delivery of the 

1 5 peptide to that tissue relative to the amount of underivatized peptide that is delivered to the 

tissue. In a still further embodiment, the amount of derivatized peptide delivered to a specific 
tissue within a selected time period is enhanced by derivatization by at least about 20%, more 
preferably, at least about 40%, and more preferably still, at least about 100%. Presently, 
preferred biomolecules for targeting applications include antibodies, hormones and ligands 

20 for cell-surface receptors. 

[0318] In still a further exemplary embodiment, there is provided as conjugate with biotin. 
Thus, for example, a selectively biotinylated peptide is elaborated by the attachment of an 
avidin or streptavidin moiety bearing one or more modifying groups. 

Therapeutic Moieties 

25 [0319] In another embodiment, the modified sugar includes a therapeutic moiety. Those of 
skill in the art will appreciate that there is overlap between the category of therapeutic 
moieties and biomolecules; many biomolecules have therapeutic properties or potential. 

[0320] The therapeutic moieties can be agents already accepted for clinical use or they can 
be drugs whose use is experimental, or whose activity or mechanism of action is under 
30 investigation. The therapeutic moieties can have a proven action in a given disease state or 
can be only hypothesized to show desirable action in a given disease state. In another 
embodiment, the therapeutic moieties are compounds, which are being screened for their 

84 



WO 2005/070138 PCT/US2005/000799 

ability to interact with a tissue of choice. Therapeutic moieties, which are useful in practicing 
the instant invention include drugs from a broad range of drug classes having a variety of 
pharmacological activities. Preferred therapeutic moieties are essentially non-fluorescent, or 
emit such a minimal amount of fluorescence that they are inappropriate for use as a 
5 fluorescent marker in an assay. Moreover, it is generally preferred to use therapeutic 
moieties that are not sugars. An exception to this preference is the use of a sugar that is 
modified by covalent attachment of another entity, such as a PEG, biomolecule, therapeutic 
moiety, diagnostic moiety and the like. In another exemplary embodiment, a therapeutic 
sugar moiety is conjugated to a linker arm and the sugar-linker arm cassette is subsequently 
1 0 conjugated to a peptide via a method of the invention. 

[0321] Methods of conjugating therapeutic and diagnostic agents to various other species 
are well known to those of skill in the art. See, for example Hermanson, BlOCONJUGATE 
Techniques, Academic Press, San Diego, 1996; and Dunn et ah, Eds. Polymeric Drugs 
And Drug Delivery Systems, ACS Symposium Series Vol. 469, American Chemical 
1 5 Society, Washington, D.C. 1 991 . 

[0322] In an exemplary embodiment, the therapeutic moiety is attached to the modified 
sugar via a linkage that is cleaved under selected conditions. Exemplary conditions include, 
but are not limited to, a selected pH (e.g., stomach, intestine, endocytotic vacuole), the 
presence of an active enzyme (e.g, esterase, reductase, oxidase), light, heat and the like. 
20 Many cleavable groups are known in the art. See, for example, Jung et ah, Biochem. 

Biophys. Acta, 761: 152-162 (1983); Joshi et ah, J. Biol. Chem., 265: 14518-14525 (1990); 
Zarling et ah, J. Immunol., 124: 913-920 (1980); Bouizar et ah, Eur. J. Biochem., 155: 141- 
147 (1986); Park et ah, J. Biol. Chem., 261: 205-210 (1986); Browning et ah, J. Immunol, 
143: 1859-1867 (1989). 

25 [0323] Classes of useful therapeutic moieties include, for example, non-steroidal anti- 
inflammatory drugs (NSAIDS). The NSAIDS can, for example, be selected from the 
following categories: {e.g., propionic acid derivatives, acetic acid derivatives, fenamic acid 
derivatives, biphenylcarboxylic acid derivatives and oxicams); steroidal anti-inflammatory 
drugs including hydrocortisone and the like; antihistaminic drugs (e.g., chlorpheniramine, 

30 triprolidine); antitussive drugs (e.g., dextromethorphan, codeine, caramiphen and 

carbetapentane); antipruritic drugs (e.g., methdilazine and trimeprazine); anticholinergic 
drugs (e.g., scopolamine, atropine, homatropine, levodopa); anti-emetic and antinauseant 
drugs (e.g., cyclizine, meclizine, chlorpromazine, buclizine); anorexic drugs (e.g., 
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benzphetamine, phentermine, cWorphentermine, fenfluramine); central stimulant drugs (e.g., 
amphetamine, methamphetamine, dextroamphetamine and methylphenidate); antiarrhythmic 
d^gs (e.g., propanolol, procainamide, disopyramide, quinidine, encainide); p-adrenergic 
blocker drugs (e.g., metoprolol, acebutolol, betaxolol, labetalol and timolol); cardiotonic 
drugs (e.g., milrinone 5 amrinone and dobutamine); antihypertensive drugs (e.g., enalapril, 
clonidine, hydralazine, minoxidil, guanadrel, guanethidine);diuretic drugs (e.g., amiloride and 
hydrochlorothiazide); vasodilator drugs (e.g., diltiazem, amiodarone, isoxsuprine, nylidrin, 
tolazoline and verapamil); vasoconstrictor drugs (e.g., dihydroergotamine, ergotamine and 
methylsergide); antiulcer drugs (e.g., ranitidine and cimetidine); anesthetic drugs (e.g., 
lidocaine, bupivacaine, chloroprocaine, dibucaine); antidepressant drugs (e.g., imipramine, 
desipramine, amitryptiline, nortryptiline); tranquilizer and sedative drugs (e.g., 
chlordiazepoxide, benacytyzine, benzquinamide, flurazepam, hydroxyzine, loxapine and 
promazine); antipsychotic drugs (e.g., chlorprothixene, fluphenazine, haloperidol, molindone, 
thioridazine and trifluoperazine); antimicrobial drugs (antibacterial, antifungal, antiprotozoal 
and antiviral drugs). 

[0324] Antimicrobial drugs which are preferred for incorporation into the present 
composition include, for example, pharmaceutical^ acceptable salts of p -lactam drugs, 
quinolone drugs, ciprofloxacin, norfloxacin, tetracycline, erythromycin, amikacin, triclosan, 
doxycycline, capreomycin, chlorhexidine, chlortetracycline, oxytetracycline, clindamycin, 
ethambutol, hexamidine isothionate, metronidazole, pentamidine, gentamycin, kanamycin, 
lineomycin, methacycline, methenamine, minocycline, neomycin, netilmycin, paromomycin, 
streptomycin, tobramycin, miconazole and amantadine. 

[0325] Other drug moieties of use in practicing the present invention include antineoplastic 
drugs (e.g., antiandrogens (e.g., leuprolide or flutamide), cytocidal agents (e.g., adriamycin, 
doxorubicin, taxol, cyclophosphamide, busulfan, cisplatin, P-2-interferon) anti-estrogens 
(e.g., tamoxifen), antimetabolites (e.g., fluorouracil, methotrexate, mercaptopurine, 
thioguanine). Also included within this class are radioisotope-based agents for both 
diagnosis and therapy, and conjugated toxins, such as ricin, geldanamycin, mytansin, CC- 
1065, the duocarmycins, Chlicheamycin and related structures and analogues thereof. 

[0326] The therapeutic moiety can also be a hormone (e.g., medroxyprogesterone, 
estradiol, leuprolide, megestrol, octreotide or somatostatin); muscle relaxant drugs (e.g., 
cinnamedrine, cyclobenzaprine, flavoxate, orphenadrine, papaverine, mebeverine, idaverine, 
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ritodrine, diphenoxylate, dantrolene and azumolen); antispasmodic drugs; bone-active drugs 
{e.g., diphosphonate and phosphonoalkylphosphinate drug compounds); endocrine 
modulating drugs {e.g., contraceptives {e.g., ethinodiol, ethinyl estradiol, norethindrone, 
mestranol, desogestrel, medroxyprogesterone), modulators of diabetes {e.g., glyburide or 
5 chlorpropamide), anabolics, such as testolactone or stanozolol, androgens {e.g., 

methyltestosterone, testosterone or fluoxymesterone), antidiuretics {e.g., desmopressin) and 
calcitonins). 

[0327] Also of use in the present invention are estrogens {e.g., diethylstilbesterol), 
glucocorticoids {e.g. , triamcinolone, betamethasone, etc.) and progestogens, such as 
10 norethindrone, ethynodiol, norethindrone, levonorgestrel; thyroid agents {e.g., liothyronine or 
levothyroxine) or anti-thyroid agents {e.g., methimazole); antihyperprolactinemic drugs {e.g., 
cabergoline); hormone suppressors {e.g., danazol or goserelin), oxytocics {e.g., 
methylergonovine or oxytocin) and prostaglandins, such as mioprostol, alprostadil or 
dinoprostone, can also be employed. 

1 5 [0328] Other useful modifying groups include immunomodulating drugs {e.g. , 

antihistamines, mast cell stabilizers, such as lodoxamide and/or cromolyn, steroids {e.g., 
triamcinolone, beclomethazone, cortisone, dexamethasone, prednisolone, 
methylprednisolone, beclomethasone, or clobetasol), histamine H2 antagonists {e.g., 
famotidine, cimetidine, ranitidine), immunosuppressants {e.g., azathioprine, cyclosporin), etc. 

20 Groups with anti-inflammatory activity, such as sulindac, etodolac, ketoprofen and ketorolac, 
are also of use. Other drugs of use in conjunction with the present invention will be apparent 
to those of skill in the art. 

Preparation of Modified Sugars 

[0329] In general, the sugar moiety and the modifying group are linked together through 
25 the use of reactive groups, which are typically transformed by the linking process into a new 
organic functional group or unreactive species. The sugar reactive functional group(s), is 
located at any position on the sugar moiety. Reactive groups and classes of reactions useful 
in practicing the present invention are generally those that are well known in the art of 
bioconjugate chemistry. Currently favored classes of reactions available with reactive sugar 
30 moieties are those, which proceed under relatively mild conditions. These include, but are 
not limited to nucleophilic substitutions {e.g., reactions of amines and alcohols with acyl 
halides, active esters), electrophilic substitutions {e.g., enamine reactions) and additions to 
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carbon-carbon and carbon-heteroatom multiple bonds (e.g., Michael reaction, Diels- Alder 
addition). These and other useful reactions are discussed in, for example, March, Advanced 
Organic Chemistry, 3rd Ed., John Wiley & Sons, New York, 1985; Hermanson, 
Bioconjugate Techniques, Academic Press, San Diego, 1996; and Feeney et al, 
5 Modification of Proteins; Advances in Chemistry Series, Vol. 198, American Chemical 
Society, Washington, D.C., 1982. 

[0330] Useful reactive functional groups pendent from a sugar nucleus or modifying group 
include, but are not limited to: 

(a) carboxyl groups and various derivatives thereof including, but not limited to, 

1 0 N-hydroxysuccinimide esters, N-hydroxybenztriazole esters, acid halides, acyl 

imidazoles, thioesters, p-nitrophenyl esters, alkyl, alkenyl, alkynyl and 
aromatic esters; 

(b) hydroxyl groups, which can be converted to, e.g., esters, ethers, aldehydes, etc. 

(c) haloalkyl groups, wherein the halide can be later displaced with a nucleophilic 
1 5 group such as, for example, an amine, a carboxylate anion, thiol anion, 

carbanion, or an alkoxide ion, thereby resulting in the covalent attachment of a 
new group at the functional group of the halogen atom; 

(d) dienophile groups, which are capable of participating in Diels- Alder reactions 

such as, for example, maleimido groups; 

20 (e) aldehyde or ketone groups, such that subsequent derivatization is possible via 

formation of carbonyl derivatives such as, for example, imines, hydrazones, 
semicarbazones or oximes, or via such mechanisms as Grignard addition or 
alkyllithium addition; 

(f) sulfonyl halide groups for subsequent reaction with amines, for example, to form 
25 sulfonamides; 

(g) thiol groups, which can be, for example, converted to disulfides or reacted with 

acyl halides ; 

(h) amine or sulfhydryl groups, which can be, for example, acylated, alkylated or 

oxidized; 

30 (i) alkenes, which can undergo, for example, cycloadditions, acylation, Michael 

addition, etc; and 
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(j) epoxides, which can react with, for example, amines and hydroxyl compounds. 

[0331] The reactive functional groups can be chosen such that they do not participate in, or 
interfere with, the reactions necessary to assemble the reactive sugar nucleus or modifying 
group. Alternatively, a reactive functional group can be protected from participating in the 
5 reaction by the presence of a protecting group. Those of skill in the art understand how to 
protect a particular functional group such that it does not interfere with a chosen set of 
reaction conditions. For examples of useful protecting groups, see, for example, Greene et 
al, Protective Groups in Organic Synthesis, John Wiley & Sons, New York, 1991. 

[0332] In the discussion that follows, a number of specific examples of modified sugars 
1 0 that are useful in practicing the present invention are set forth. In the exemplary 

embodiments, a sialic acid derivative is utilized as the sugar nucleus to which the modifying 
group is attached. The focus of the discussion on sialic acid derivatives is for clarity of 
illustration only and should not be construed to limit the scope of the invention. Those of 
skill in the art will appreciate that a variety of other sugar moieties can be activated and 
1 5 derivatized in a manner analogous to that set forth using sialic acid as an example. For 
example, numerous methods are available for modifying galactose, glucose, N- 
acetylgalactosamine and fiicose to name a few sugar substrates, which are readily modified 
by art recognized methods. See, for example, Elhalabi et al, Curr. Med. Chem. 6: 93 (1999); 
and Schafer et al, J. Org. Chem. 65: 24 (2000)). 

20 [0333] In an exemplary embodiment, the peptide that is modified by a method of the 
invention is a glycopeptide that is produced in prokaryotic cells {e.g., E. colt), eukaryotic 
cells including yeast and mammalian cells {e.g., CHO cells), or in a transgenic animal and 
thus contains N- and/or Olinked oligosaccharide chains, which are incompletely sialylated. 
The oligosaccharide chains of the glycopeptide lacking a sialic acid and containing a terminal 

25 galactose residue can be glyco-PEG-ylated, glyco-PPG-ylated or otherwise modified with a 
modified sialic acid. 

[0334] In Scheme 4, the amino glycoside 1, is treated with the active ester of a protected 
amino acid {e.g., glycine) derivative, converting the sugar amine residue into the 
corresponding protected amino acid amide adduct. The adduct is treated with an aldolase to 
30 form a-hydroxy carboxylate 2. Compound 2 is converted to the corresponding CMP 

derivative by the action of CMP-SA synthetase, followed by catalytic hydrogenation of the 
CMP derivative to produce compound 3. The amine introduced via formation of the glycine 
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adduct is utilized as a locus of PEG or PPG attachment by reacting compound 3 with an 
activated (m-) PEG or (m-) PPG derivative (e.g., PEG~C(0)NHS, PPG-C(O)NHS), 
producing 4 or 5, respectively. 



Scheme 4 
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[0335] Table 2 sets forth representative examples of sugar monophosphates that are 
derivatized with a PEG or PPG moiety. Certain of the compounds of Table 2 are prepared by 
the method of Scheme 4. Other derivatives are prepared by art-recognized methods. See, for 
example, Keppler et aL, Glycobiology 11: 1 1R (2001); and Charter et ah, Glycobiology 10: 
1049 (2000)). Other amine reactive PEG and PPG analogues are commercially available, or 
they can be prepared by methods readily accessible to those of skill in the art. 
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[0336] The modified sugar phosphates of use in practicing the present invention can be 
substituted in other positions as well as those set forth above. Presently preferred 
substitutions of sialic acid are set forth in Formula I: 

NH 2 



X-R 1 T o- + Na W 
t^r— OTVO'^Na HOOI 




OH 

00 



in which X is a linking group, which is preferably selected from -O-, -N(H)-, -S, CH 2 -, and - 
N(R) 2 , in which each R is a member independently selected from R^R 5 . The symbols Y, Z, 
A and B each represent a group that is selected from the group set forth above for the identity 
ofX. X,Y,Z,AandB are each independently selected and, therefore, they can be the same 
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or different. The symbols R 1 , R 2 , R 3 , R 4 and R 5 represent H, a water-soluble polymer, 
therapeutic moiety, biomolecule or other moiety. Alternatively, these symbols represent a 
linker that is bound to a water-soluble polymer, therapeutic moiety, biomolecule or other 
moiety. 

[0337] Exemplary moieties attached to the conjugates disclosed herein include, but are not 
limited to, PEG derivatives (e.g., alkyl-PEG, acyl-PEG, acyl-alkyl-PEG, alkyl-acyl-PEG 
carbamoyl-PEG, aryl-PEG), PPG derivatives (e.g., alkyl-PPG, acyl-PPG, acyl-alkyl-PPG, 
alkyl-acyl-PPG carbamoyl-PPG, aryl-PPG), therapeutic moieties, diagnostic moieties, 
mannose-6-phosphate, heparin, heparan, SLe x , marinose, mannose-6-phosphate, Sialyl Lewis 
X, FGF, VFGrF, proteins, chondroitin, keratan, dermatan, albumin, integrins, antennary 
oligosaccharides, peptides and the like. Methods of conjugating the various modifying 
groups to a saccharide moiety are readily accessible to those of skill in the art (Poly 
(Ethylene Glycol Chemistry : Biotechnical and Biomedical Applications, J. Milton 
Harris, Ed., Plenum Pub. Corp., 1992; Poly (Ethylene Glycol) Chemical and 
Biological Applications, J. Milton Harris, Ed., ACS Symposium Series No. 680, 
American Chemical Society, 1997; Hermanson, Bioconjugate Techniques, Academic 
Press, San Diego, 1996; and Dunn et al 9 Eds. Polymeric Drugs And Drug Delivery 
Systems, ACS Symposium Series Vol. 469, American Chemical Society, Washington, D.C. 
1991). 

Cross-linking Groups 

[0338] Preparation of the modified sugar for use in the methods of the present invention 
includes attachment of a modifying group to a sugar residue and forming a stable adduct, 
which is a substrate for a glycosyltransferase. The sugar and modifying group can be coupled 
by a zero- or higher-order cross-linking agent. Exemplary bifunctional compounds which 
can be used for attaching modifying groups to carbohydrate moieties include, but are not 
limited to, bifunctional poly(ethyleneglycols), polyamides, polyethers, polyesters and the 
like. General approaches for linking carbohydrates to other molecules are known in the 
literature. See, for example, Lee et al, Biochemistry 28: 1856 (1989); Bhatia et al, Anal 
Biochern. 178: 408 (1989); Janda et al, J. Am. Chem. Soc. 112: 8886 (1990) and Bednarski et 
al, WO 92/18135. In the discussion that follows, the reactive groups are treated as benign on 
the sugar moiety of the nascent modified sugar. The focus of the discussion is for clarity of 
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illustration. Those of skill in the art will appreciate that the discussion is relevant to reactive 
groups on the modifying group as well. 

[0339] An exemplary strategy involves incorporation of a protected sulfhydryl onto the 
sugar using the heterobifunctional crosslinker SPDP (n-succinimidyl-3-(2- 
5 pyridyldithio)propionate and then deprotecting the sulfhydryl for formation of a disulfide 
bond with another sulfhydryl on the modifying group. 

[0340] If SPDP detrimentally affects the ability of the modified sugar to act as a 
glycosyltransferase substrate, one of an array of other crosslinkers such as 2-iminothiolane or 
N-succinimidyl S-acetylthioacetate (SAT A) is used to form a disulfide bond. 2- 
1 0 iminothiolane reacts with primary amines, instantly incorporating an unprotected sulfhydryl 
onto the amine-containing molecule. SATA also reacts with primary amines, but 
incorporates a protected sulfhydryl, which is later deacetaylated using hydroxylamine to 
produce a free sulfhydryl. In each case, the incorporated sulfhydryl is free to react with other 
sulfhydryls or protected sulfhydryl, like SPDP, forming the required disulfide bond. 

1 5 [0341] The above-described strategy is exemplary, and not limiting, of linkers of use in the 
invention. Other crosslinkers are available that can be used in different strategies for 
crosslinking the modifying group to the peptide. For example, TPCH(S-(2-thiopyridyl)-L- 
cysteine hydrazide and TPMPH ((S-(2-thiopyridyl) mercapto-propionohydrazide) react with 
carbohydrate moieties that have been previously oxidized by mild periodate treatment, thus 

20 forming a hydrazone bond between the hydrazide portion of the crosslinker and the periodate 
generated aldehydes. TPCH and TPMPH introduce a 2-pyridylthione protected sulfhydryl 
group onto the sugar, which can be deprotected with DTT and then subsequently used for 
conjugation, such as forming disulfide bonds between components. 

[0342] If disulfide bonding is found unsuitable for producing stable modified sugars, other 
25 crosslinkers may be used that incorporate more stable bonds between components. The 
heterobifunctional crosslinkers GMBS (N-gama-malimidobutyryloxy)succinimide) and 
SMCC (succinimidyl 4-(N-maleimido-methyl)cyclohexane) react with primary amines, thus 
introducing a maleimide group onto the component. The maleimide group can subsequently 
react with sulfhydryls on the other component, which can be introduced by previously 
30 mentioned crosslinkers, thus forming a stable thioether bond between the components. If 

steric hindrance between components interferes with either component's activity or the ability 
of the modified sugar to act as a glycosyltransferase substrate, crosslinkers can be used which 
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introduce long spacer arms between components and include derivatives of some of the 
previously mentioned crosslinkers (i.e., SPDP). Thus, there is an abundance of suitable 
crosslinkers, which are useful; each of which is selected depending on the effects it has on 
optimal peptide conjugate and modified sugar production. 

[0343] A variety of reagents are used to modify the components of the modified sugar with 
intramolecular chemical crosslinks (for reviews of crosslinking reagents and crosslinking 
procedures see: Wold, R, Meth. Enzymol. 25: 623-651, 1972; Weetall, H. H., and Cooney, D. 
A., In: Enzymes as Drugs. (Holcenberg, and Roberts, eds.) pp. 395-442, Wiley, New York, 
1981; Ji, T. H., Meth. Enzymol. 91: 580-609, 1983; Mattson et al, Mol. Biol. Rep. 17: 167- 
183, 1993, all of which are incorporated herein by reference). Preferred crosslinking reagents 
are derived from various zero-length, homo-bifunctional, and hetero-bifunctional crosslinking 
reagents. Zero-length crosslinking reagents include direct conjugation of two intrinsic 
chemical groups with no introduction of extrinsic material. Agents that catalyze formation of 
a disulfide bond belong to this category. Another example is reagents that induce 
condensation of a carboxyl and a primary amino group to form an amide bond such as 
carbodiimides, ethylchloroformate, Woodward's reagent K (2-ethyl-5-phenylisoxazolium-3'- 
sulfonate), and carbonyldiimidazole. In addition to these chemical reagents, the enzyme 
transglutaminase (glutamyl-peptide y-glutamyltransferase; EC 2.3.2.13) may be used as zero- 
length crosslinking reagent. This enzyme catalyzes acyl transfer reactions at carboxamide 
groups of protein-bound glutaminyl residues, usually with a primary amino group as 
substrate. Preferred homo- and hetero-bifunctional reagents contain two identical or two 
dissimilar sites, respectively, which may be reactive for amino, sulfhydryl, guanidino, indole, 
or nonspecific groups. 



/. Preferred Specific Sites in Crosslinking Reagents 
1. Amino-Reactive Groups 

[0344] In one embodiment, the sites on the cross-linker are amino-reactive groups. Useful 
non-limiting examples of amino-reactive groups include N-hydroxysuccinimide (NHS) 
esters, imidoesters, isocyanates, acylhalides, arylazides, p-nitrophenyl esters, aldehydes, and 
sulfonyl chlorides. 

[0345] NHS esters react preferentially with the primary (including aromatic) amino groups 
of a modified sugar component. The imidazole groups of histidines are known to compete 
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with primary amines for reaction, but the reaction products are unstable and readily 
hydrolyzed. The reaction involves the nucleophilic attack of an amine on the acid carboxyl 
of an NHS ester to form an amide, releasing the N-hydroxysuccinimide. Thus, the positive 
charge of the original amino group is lost. 

5 [0346] Imidoesters are the most specific acylating reagents for reaction with the amine 

groups of the modified sugar components. At a pH between 7 and 10, imidoesters react only 
with primary amines. Primary amines attack imidates nucleophilically to produce an 
intermediate that breaks down to amidine at high pH or to a new imidate at low pH. The new 
imidate can react with another primary amine, thus crosslinking two amino groups, a case of 
10 a putatively mono functional imidate reacting bifunctionally. The principal product of 

reaction with primary amines is an amidine that is a stronger base than the original amine. 
The positive charge of the original amino group is therefore retained. 

[0347] Isocyanates (and isothiocyanates) react with the primary amines of the modified 
sugar components to form stable bonds. Their reactions with sulfhydryl, imidazole, and 
1 5 tyrosyl groups give relatively unstable products. 

[0348] Acylazides are also used as amino-specific reagents in which nucleophilic amines of 
the affinity component attack acidic carboxyl groups under slightly alkaline conditions, e.g. 
pH 8.5. 

[0349] Arylhalides such as l,5-difluoro-2,4-dinitrobenzene react preferentially with the 
20 amino groups and tyrosine phenolic groups of modified sugar components, but also with 
sulfhydryl and imidazole groups. 

[0350] p-Nitrophenyl esters of mono- and dicarboxylic acids are also useful amino-reactive 
groups. Although the reagent specificity is not very high, a- and e-amino groups appear to 
react most rapidly. 

25 [0351] Aldehydes such as glutaraldehyde react with primary amines of modified sugar. 
Although unstable Schiff bases are formed upon reaction of the amino groups with the 
aldehydes of the aldehydes, glutaraldehyde is capable of modifying the modified sugar with 
stable crosslinks. At pH 6-8, the pH of typical crosslinking conditions, the cyclic polymers 
undergo a dehydration to form a-P unsaturated aldehyde polymers. Schiff bases, however, 

30 are stable, when conjugated to another double bond. The resonant interaction of both double 
bonds prevents hydrolysis of the Schiff linkage. Furthermore, amines at high local 
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concentrations can attack the ethylenic double bond to form a stable Michael addition 
product. 

[0352] Aromatic sulfonyl chlorides react with a variety of sites of the modified sugar 
components, but reaction with the amino groups is the most important, resulting in a stable 
sulfonamide linkage. 

2. Sulfhydryl-Reactive Groups 

[0353] In another embodiment, the sites are sulfhydryl-reactive groups. Useful, non- 
limiting examples of sulfhydryl-reactive groups include maleimides, alkyl halides, pyridyl 
disulfides, and thiophthalimides. 

[0354] Maleimides react preferentially with the sulfhydryl group of the modified sugar 
components to form stable thioether bonds. They also react at a much slower rate with 
primary amino groups and the imidazole groups of histidines. However, at pH 7 the 
maleimide group can be considered a sulfhydryl-specific group, since at this pH the reaction 
rate of simple thiols is 1000-fold greater than that of the corresponding amine. 

[0355] Alkyl halides react with sulfhydryl groups, sulfides, imidazoles, and amino groups. 
At neutral to slightly alkaline pH, however, alkyl halides react primarily with sulfhydryl 
groups to form stable thioether bonds. At higher pH, reaction with amino groups is favored. 

[0356] Pyridyl disulfides react with free sulfhydryls via disulfide exchange to give mixed 
disulfides. As a result, pyridyl disulfides are the most specific sulfhydryl-reactive groups. 

[0357] Thiophthalimides react with free sulfhydryl groups to form disulfides. 

3. Carboxyl-Reactive Residue 

[0358] In another embodiment, carbodiimides soluble in both water and organic solvent, 
are used as carboxyl-reactive reagents. These compounds react with free carboxyl groups 
forming a pseudourea that can then couple to available amines yielding an amide linkage 
teach how to modify a carboxyl group with carbodiimde (Yamada et al, Biochemistry 20: 
4836-4842, 1981). 

it Preferred Nonspecific Sites in Crosslinking Reagents 

[0359] In addition to the use of site-specific reactive moieties, the present invention 
contemplates the use of non-specific reactive groups to link the sugar to the modifying group. 



96 



WO 2005/070138 PCT/US2005/000799 

[0360] Exemplary non-specific cross-linkers include photoactivatable groups, completely 
inert in the dark, which are converted to reactive species upon absorption of a photon of 
appropriate energy. In one embodiment, photoactivatable groups are selected from 
precursors of nitrenes generated upon heating or photolysis of azides. Electron-deficient 

■ 

5 nitrenes are extremely reactive and can react with a variety of chemical bonds including N-H, 
O-H, C-H, and C=C. Although three types of azides (aryl, alkyl, and acyl derivatives) may 
be employed, arylazides are presently. The reactivity of arylazides upon photolysis is better 
with N-H and O-H than C-H bonds. Electron-deficient arylnitrenes rapidly ring-expand to 
form dehydroazepines, which tend to react with nucleophiles, rather than form C-H insertion 

10 products. The reactivity of arylazides can be increased by the presence of electron- 
withdrawing substituents such as nitro or hydroxyl groups in the ring. Such substituents push 
the absorption maximum of arylazides to longer wavelength. Unsubstituted arylazides have 
an absorption maximum in the range of 260-280 nm, while hydroxy and nitroarylazides 
absorb significant light beyond 305 nm. Therefore, hydroxy and nitroarylazides are most 

1 5 , preferable since they allow to employ less harmful photolysis conditions for the affinity 
component than unsubstituted arylazides. 

[0361] In another preferred embodiment, photoactivatable groups are selected from 
fluorinated arylazides. The photolysis products of fluorinated arylazides are arylnitrenes, all 
of which undergo the characteristic reactions of this group, including C-H bond insertion, 
20 with high efficiency (Keana et al, J. Org. Chem. 55: 3640-3647, 1990). 

[0362] In another embodiment, photoactivatable groups are selected from benzophenone 
residues. Benzophenone reagents generally give higher crosslinking yields than arylazide 
reagents. 

[0363] In another embodiment, photoactivatable groups are selected from diazo 
25 compounds, which form an electron-deficient carbene upon photolysis. These carbenes 

undergo a variety of reactions including insertion into C-H bonds, addition to double bonds 
(including aromatic systems), hydrogen attraction and coordination to nucleophilic centers to 
give carbon ions. 

[0364] In still another embodiment, photoactivatable groups are selected from 
30 diazopyruvates. For example, the p-nitrophenyl ester of p-nitrophenyl diazopyruvate reacts 
with aliphatic amines to give diazopyruvic acid amides that undergo ultraviolet photolysis to 
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form aldehydes. The photolyzed diazopyruvate-modified affinity component will react like 
formaldehyde or glutaraldehyde forming crosslinks. 

iii. Homobifunctional Reagents 

1. Homobifunctional crosslinkers reactive with primary amines 

[0365] Synthesis, properties, and applications of amine-reactive cross-linkers are 
commercially described in the literature (for reviews of crosslinking procedures and reagents, 
see above). Many reagents are available {e.g., Pierce Chemical Company, Rockford, 111.; 
Sigma Chemical Company, St. Louis, Mo.; Molecular Probes, Inc., Eugene, OR.). 

[0366] Preferred, non-limiting examples of homobifunctional NHS esters include 
disuccinimidyl glutarate (DSG), disuccinimidyl suberate (DSS), bis(sulfosuccinimidyl) 
suberate (BS), disuccinimidyl tartarate (DST), disulfosuccinimidyl tartarate (sulfo-DST), bis- 
2-(succinimidooxycarbonyloxy)ethylsulfone(BSOCOES), bis-2-(sulfosuccinimidooxy- 
carbonyloxy)ethylsulfone (sulfo-BSOCOES), ethylene glycolbis(succinimidylsuccinate) 
(EGS), ethylene glycolbis(sulfosuccinimidylsuccinate) (sulfo-EGS), dithiobis(succinimidyl- 
propionate (DSP), and dithiobis(sulfosuccinimidylpropionate (sulfo-DSP). Preferred, non- 
limiting examples of homobifunctional imidoesters include dimethyl malonimidate (DMM), 
dimethyl succinimidate (DMSC), dimethyl adipimidate (DMA), dimethyl pimelimidate 
(DMP), dimethyl suberimidate (DMS), dimethyl-3,3'-oxydipropionimidate (DODP), 
dimethyl-3 ,3 '-(methy lenedioxy)dipropionimidate (DMDP), dimethyl-,3 '- 

(dimethylenedioxy)dipropionimidate(DDDP),dimethyl-3,3'-(tetramethylenedioxy)- 
dipropionimidate (DTDP), and dimethyl-3,3'-dithiobispropionimidate (DTBP). 

[0367] Preferred, non-limiting examples of homobifunctional isothiocyanates include: p- 

phenylenediisothiocyanate (DITC), and 4,4'-diisothiocyano-2,2'-disulfonic acid stilbene 
(DIDS). 

[0368] Preferred, non-limiting examples of homobifunctional isocyanates include xylene- 
diisocyanate, toluene-2,4-diisocyanate, toluene-2-isocyanate-4-isothiocyanate, 3- 

methoxydiphenylmetlaane-4,4'-diisocyanate, 2,2 , -dicarboxy-4,4 , -azophenyldiisocyanate, and 
hexamethylenediisocyanate. 

[0369] Preferred, non-limiting examples of homobifunctional arylhalides include 1,5- 
difluoro-2,4-dinitrobenzene (DFDNB), and 4,4'-difluoro-3,3 , -dinitrophenyl-sulfone. 
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[0370] Preferred, non-limiting examples of homobifunctional aliphatic aldehyde reagents 
include glyoxal, malondialdehyde, and glutaraldehyde. 

[0371] Preferred, non-limiting examples of homobifunctional acylating reagents include 
nitrophenyl esters of dicarboxylic acids. 

5 [0372] Preferred, non-limiting examples of homobifunctional aromatic sulfonyl chlorides 
include phenol-2,4-disulfonyl chloride, and a-naphthol-2,4-disulfonyl chloride. 

[0373] Preferred, non-limiting examples of additional amino-reactive homobifunctional 
reagents include erythritolbiscarbonate which reacts with amines to give biscarbamates. 

2. Homobifunctional Crosslinkers Reactive with Free Sulfhydryl Groups 

[0374] Synthesis, properties, and applications of such reagents are described in the 
literature (for reviews of crosslinking procedures and reagents, see above). Many of the 
reagents are commercially available (e.g., Pierce Chemical Company, Rockford, 111.; Sigma 
Chemical Company, St. Louis, Mo.; Molecular Probes, Inc., Eugene, OR). 

[0375] Preferred, non-limiting examples of homobifunctional maleimides include 
bismaleimidohexane (BMH), N,N'-(l,3-phenylene) bismaleimide, N,N'-(1,2- 
phenylene)bismaleimide, azophenyldimaleimide, and bis(N-maleimidomethyl)ether. 

[0376] Preferred, non-limiting examples of homobifunctional pyridyl disulfides include 
1 ,4-di-3'-(2'-pyridyldithio)propionamidobutane (DPDPB). 

« 

[0377] Preferred, non-limiting examples of homobifunctional alkyl halides include 2,2'- 
dicarboxy-4,4 f -diiodoacetamidoazobenzene, a,a f -diiodo-p-xylenesulfonic acid, a, a'-dibromo- 
p-xylenesulfonic acid, N,N'-bis(b-bromoethyl)benzylamine, N,N- 
di(bromoacetyl)phenylthydrazine, and 1 ,2-di(bromoacetyl)amino~3~phenylpropane. 

3. Homobifunctional Photoactivatable Crosslinkers 

[0378] Synthesis, properties, and applications of such reagents are described in the 
25 literature (for reviews of crosslinking procedures and reagents, see above). Some of the 

reagents are commercially available (e.g., Pierce Chemical Company, Rockford, 111.; Sigma 
Chemical Company, St. Louis, Mo.; Molecular Probes, Inc., Eugene, OR). 

[0379] Preferred, non-limiting examples of homobifunctional photoactivatable crosslinker 
include bis-P-(4-azidosalicylamido)ethyldisulfide (BASED), di-N-(2-nitro-4-azidophenyl)- 
30 cystamine-S,S-dioxide (DNCO), and 4,4-dithiobisphenylazide. 
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iv. HeteroBifunctional Reagents 

1. Amino-Reactive HeteroBifunctional Reagents with a Pyridyl Disulfide Moiety 

[0380] Synthesis, properties, and applications of such reagents are described in the 
literature (for reviews of crosslinking procedures and reagents, see above). Many of the 
5 reagents are commercially available (e.g., Pierce Chemical Company, Rockford, 111.; Sigma 
Chemical Company, St. Louis, Mo.; Molecular Probes, Inc., Eugene, OR). 

[0381] Preferred, non-limiting examples of hetero-bifunctional reagents with a pyridyl 
disulfide moiety and an amino-reactive NHS ester include N-succinimidyl-3-(2- 
pyridyldithio)propionate (SPDP), succinimidyl 6-3-(2-pyridyldithio)propionamidohexanoate 
10 (LC-SPDP), sulfosuccinimidyl 6-3-(2-pyridyldithio)propionamidohexanoate (sulfo- 

LCSPDP), 4-succinimidyloxycarbonyl-a-methyl-a-(2-pyridyldithio)toluene (SMPT), and 
sulfosuccinimidyl 6-a-methyl-a-(2-pyridyldithio)toluamidohexanoate (sulfo-LC-SMPT). 

2. Amino-Reactive HeteroBifunctional Reagents with a Maleimide Moiety 

[0382] Synthesis, properties, and applications of such reagents are described in the 
15 literature. Preferred, non-limiting examples of hetero-bifunctional reagents with a maleimide 
moiety and an amino-reactive NHS ester include succinimidyl maleimidylacetate (AMAS), 
succinimidyl 3-maleimidylpropionate (BMPS), N- y-maleimidobutyryloxy succinimide ester 
(GMBS)N-y-maleimidobutyryloxysulfo succinimide ester (sulfo-GMBS) succinimidyl 6- 
maleimidylhexanoate (EMCS), succinimidyl 3-maleimidylbenzoate (SMB), m- 
20 maleimidobenzoyl-N-hydroxy succinimide ester (MBS), m-maleimidobenzoyl-N- 
hydroxysulfosuccinimide ester (sulfo-MBS), succinimidyl 4-(N-maleimidomethyl)- 
cyclohexane- 1 -carboxylate (SMCC), sulfosuccinimidyl 4-(N-maleimidomethyl)cyclohexane- 
1 -carboxylate (sulfo-SMCC), succinimidyl 4-(p-maleimidophenyl)butyrate (SMPB), and 
sulfosuccinimidyl 4-(p-maleimidophenyl)butyrate (sulfo-SMPB). 

25 3. Amino-Reactive HeteroBifunctional Reagents -with an Alkyl Halide Moiety 

[0383] Synthesis, properties, and applications of such reagents are described in the 
literature Preferred, non-limiting examples of hetero-bifunctional reagents with an alkyl 
halide moiety and an amino-reactive NHS ester include N-succinimidyl-(4- 
iodoacetyl)aminobenzoate (SIAB), sulfosuccinimidyl-(4-iodoacetyl)aminobenzoate (sulfo- 
30 SIAB), succinimidyl-6-(iodoacetyl)aminohexanoate (SIAX), succinimidyl-6-(6-((iodoacetyl)- 
amino)hexanoylamino)hexanoate (SIAXX), succinimidyl-6-(((4-(iodoacetyl)-amino)- 
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methyl)-cyclohexane-l-carbonyl)aminohexanoate (SIACX), and succinimidyl-4((iodoacetyl)- 
amino)methylcyclohexane- 1 -carboxylate (SI AC) . 

[0384] An example of a hetero-bifunctional reagent with an ammo-reactive NHS ester and 
an alkyl dihalide moiety is N-hydroxysuccinimidyl 2 5 3-dibromopropionate (SDBP). SDBP 
5 introduces intramolecular crosslinks to the affinity component by conjugating its amino 
groups. The reactivity of the dibromopropionyl moiety towards primary amine groups is 
controlled by the reaction temperature (McKenzie et al, Protein Chem. 7: 581-592 (1988)). 

[0385] Preferred, non-limiting examples of hetero-bifunctional reagents with an alkyl 
halide moiety and an amino -reactive p-nitrophenyl ester moiety include p-nitrophenyl 
1 0 iodoacetate (NPI A) . 

[0386] Other cross-linking agents are known to those of skill in the art. See, for example, 
Pomato et al, U.S. Patent No. 5,965,106. It is within the abilities of one of skill in the art to 
choose an appropriate cross-linking agent for a particular application. 

v. Cleavable Linker Groups 

1 5 [0387] In yet a further embodiment, the linker group is provided with a group that can be 
cleaved to release the modifying group from the sugar residue. Many cleaveable groups are 
known in the art. See, for example, Jung et al, Biochem. Biophys. Acta 761: 152-162 (1983); 
Joshi et al,J. Biol Chem. 265: 14518-14525 (1990); Zarling et al, J. Immunol 124: 913-920 
(1980); Bouizar et al,Eur. J. Biochem. 155: 141-147 (1986); Park et al,J. Biol Chem. 261: 

20 205-210 (1986); Browning et al,J. Immunol 143: 1859-1867 (1989). Moreover a broad 
range of cleavable, bifunctional (both homo- and hetero-bifunctional) linker groups is 
commercially available from suppliers such as Pierce. 

[0388] Exemplary cleaveable moieties can be cleaved using light, heat or reagents such as 
thiols, hydroxylamine, bases, periodate and the like. Moreover, certain preferred groups are 
25 cleaved in vivo in response to being endocytized (e.g., cis-aconityl; see, Shen et al, Biochem. 
Biophys. Res. Commun. 102: 1048 (1991)). Preferred cleaveable groups comprise a 
cleaveable moiety which is a member selected from the group consisting of disulfide, ester, 
imide, carbonate, nitrobenzyl, phenacyl and benzoin groups. 

Conjugation of Modified Sugars to Peptides 

30 [0389] The modified sugars are conjugated to a glycosylated or non-glycosylated peptide 
using an appropriate enzyme to mediate the conjugation. Preferably, the concentrations of 
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the modified donor sugar(s), enzyme(s) and acceptor peptide(s) are selected such that 
glycosylation proceeds until the acceptor is consumed. The considerations discussed below, 
while set forth in the context of a sialyltransferase, are generally applicable to other 
glycosyltransferase reactions. 

5 [0390] A number of methods of using glycosyltransferases to synthesize desired 

oligosaccharide structures are known and are generally applicable to the instant invention. 
Exemplary methods are described, for instance, WO 96/32491, Ito et aL, Pure Appl Chem. 
65: 753 (1993), and U.S. Pat. Nos. 5,352,670, 5,374,541, and 5,545,553. 

[0391] The present invention is practiced using a single glycosyltransferase or a 
10 combination of glycosyltransferases. For example, one can use a combination of a 

sialyltransferase and a galactosyltransferase. In those embodiments using more than one 
enzyme, the enzymes and substrates are preferably combined in an initial reaction mixture, or 
the enzymes and reagents for a second enzymatic reaction are added to the reaction medium 
once the first enzymatic reaction is complete or nearly complete. By conducting two 
1 5 enzymatic reactions in sequence in a single vessel, overall yields are improved over 

procedures in which an intermediate species is isolated. Moreover, cleanup and disposal of 
extra solvents and by-products is reduced. 

[0392] In another embodiment, each of the first and second enzyme is a 
glycosyltransferase. In another embodiment, one enzyme is an endoglycosidase. In an 
20 additional embodiment, more than two enzymes are used to assemble the modified 

glycoprotein of the invention. The enzymes are used to alter a saccharide structure on the 
peptide at any point either before or after the addition of the modified sugar to the peptide. 

[0393] The O-linked glycosyl moieties of the conjugates of the invention are generally 
originate with a GalNAc moiety that is attached to the peptide. Any member of the family of 

25 GalNAc transferases can be used to bind a GalNAc moiety to the peptide (Hassan H, Bennett 
EP, Mandel U, Hollingsworth MA, and Clausen H (2000). Control of Mucin-Type O- 
Glycosylation: O-Glycan Occupancy is Directed by Substrate Specificities of Polypeptide 
GalNAc-Transferases. (Eds. Ernst, Hart, and Sinay). Wiley-VCH chapter "Carbohydrates in 
Chemistry and Biology - a Comprehension Handbook", 273-292). The GalNAc moiety itself 

30 can be the intact glycosyl linker. Alternatively, the saccharyl residue is built out using one 
more enzyme and one or more appropriate glycosyl substrate for the enzyme, the modified 
sugar being added to the built out glycosyl moiety. 
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[0394] In another embodiment, the method makes use of one or more exo- or 
endoglycosidase. The glycosidase is typically a mutant, which is engineered to form glycosyl 
bonds rather than cleave them. The mutant glycanase typically includes a substitution of an 
amino acid residue for an active site acidic amino acid residue. For example, when the 
5 endoglycanase is endo-H, the substituted active site residues will typically be Asp at position 
130, Glu at position 132 or a combination thereof. The amino acids are generally replaced 
with serine, alanine, asparagine, or glutamine. 

[0395] The mutant enzyme catalyzes the reaction, usually by a synthesis step that is 
analogous to the reverse reaction of the endoglycanase hydrolysis step. In these 

10 embodiments, the glycosyl donor molecule (e.g., a desired oligo- or mono-saccharide 

structure) contains a leaving group and the reaction proceeds with the addition of the donor 
molecule to a GlcNAc residue on the protein. For example, the leaving group can be a 
halogen, such as fluoride. In other embodiments, the leaving group is a Asn, or a Asn- 
peptide moiety. In yet further embodiments, the GlcNAc residue on the glycosyl donor 

1 5 molecule is modified. For example, the GlcNAc residue may comprise a 1 ,2 oxazoline 
moiety. 

[0396] In another embodiment, each of the enzymes utilized to produce a conjugate of the 
invention are present in a catalytic amount. The catalytic amount of a particular enzyme 
varies according to the concentration of that enzyme's substrate as well as to reaction 
20 conditions such as temperature, time and pH value. Means for determining the catalytic 
amount for a given enzyme under preselected substrate concentrations and reaction 
conditions are well known to those of skill in the art. 

[0397] The temperature at which an above process is carried out can range from just above 
freezing to the temperature at which the most sensitive enzyme denatures. Preferred 
25 temperature ranges are about 0 °C to about 55 °C, and more preferably about 20 0 C to about 
30 °C. In another exemplary embodiment, one or more components of the present method 
are conducted at an elevated temperature using a thermophilic enzyme. 

[0398] The reaction mixture is maintained for a period of time sufficient for the acceptor to 
be glycosylated, thereby forming the desired conjugate. Some of the conjugate can often be 
30 detected after a few hours, with recoverable amounts usually being obtained within 24 hours 
or less. Those of skill in the art understand that the rate of reaction is dependent on a number 
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of variable factors (e.g, enzyme concentration, donor concentration, acceptor concentration, 
temperature, solvent volume), which are optimized for a selected system. 

[0399] The present invention also provides for the industrial-scale production of modified 
peptides. As used herein, an industrial scale generally produces at least about 250 mg, 
5 preferably at least about 500 mg, and more preferably at least about 1 gram of finished, 
purified conjugate, preferably after a single reaction cycle, i.e., the conjugate is riot a 
combination the reaction products from identical, consecutively iterated synthesis cycles. 

[0400] In the discussion that follows, the invention is exemplified by the conjugation of 
modified sialic acid moieties to a glycosylated peptide. The exemplary modified sialic acid is 

10 labeled with (m~) PEG. The focus of the following discussion on the use of PEG-modified 
sialic acid and glycosylated peptides is for clarity of illustration and is not intended to imply 
that the invention is limited to the conjugation of these two partners. One of skill understands 
that the discussion is generally applicable to the additions of modified glycosyl moieties other 
than sialic acid. Moreover, the discussion is equally applicable to the modification of a 

15 glycosyl unit with agents other than PEG including other water-soluble polymers, therapeutic 
moieties, and biomolecules. 

[0401] An enzymatic approach can be used for the selective introduction of (m~) 
PEG-ylated or (m-) PPG-ylated carbohydrates onto a peptide or glycopeptide. The method 
utilizes modified sugars containing PEG, PPG, or a masked reactive functional group, and is 
20 combined with the appropriate glycosyltransferase or glycosynthase. By selecting the 

glycosyltransferase that will make the desired carbohydrate linkage and utilizing the modified 
sugar as the donor substrate, the PEG or PPG can be introduced directly onto the peptide 
backbone, onto existing sugar residues of a glycopeptide or onto sugar residues that have 
been added to a peptide. 

25 [0402] An acceptor for the sialyltransferase is present on the peptide to be modified by the 
methods of the present invention either as a naturally occurring structure or one placed there 
recombinantly, enzymatically or chemically. Suitable acceptors, include, for example, 
galactosyl acceptors such as GalNAc, Gaipi,4GlcNAc, Gaipi,4GalNAc, Galpl,3GalNAc, 
lacto-N-tetraose, Gaipi,3GlcNAc, Gaipi,3Ara, Gaipi,6GlcNAc, Gaipi,4Glc (lactose), and 

30 other acceptors known to those of skill in the art (see, e.g., Paulson et al, J. Biol Chem. 253: 
5617-5624 (1978)). 
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[0403] In one embodiment, an acceptor for the sialyltransferase is present on the 
glycopeptide to be modified upon in vivo synthesis of the glycopeptide. Such glycopeptides 
can be sialylated using the claimed methods without prior modification of the glycosylation 
pattern of the glycopeptide. Alternatively, the methods of the invention can be used to 
5 sialylate a peptide that does not include a suitable acceptor; one first modifies the peptide to 
include an acceptor by methods known to those of skill in the art. In an exemplary 
embodiment, a GalNAc residue is added to an O-linked glycosylation site by the action of a 
GalNAc transferase. Hassan H, Bennett EP ? Mandel U, Hollingsworth MA, and Clausen H 
(2000). Control of Mucin-Type O-Glycosylation: O^Glycan Occupancy is Directed by 
10 Substrate Specificities of Polypeptide GalNAc-Transferases. (Eds. Ernst, Hart, and Sinay). 
Wiley-VCH chapter "Carbohydrates in Chemistry and Biology - a Comprehension 
Handbook", 273-292. 

[0404] In an exemplary embodiment, the galactosyl acceptor is assembled by attaching a 
galactose residue to an appropriate acceptor linked to the peptide, e.g., a GalNAc. The 

1 5 method includes incubating the peptide to be modified with a reaction mixture that contains a 
suitable amount of a galactosyltransferase (e.g., Gaipi,3 or Galpl,4), and a suitable 
galactosyl donor (e.g., UDP-galactose). The reaction is allowed to proceed substantially to 
completion or, alternatively, the reaction is terminated when a preselected amount of the 
galactose residue is added. Other methods of assembling a selected saccharide acceptor will 

20 be apparent to those of skill in the art. 

[0405] In yet another embodiment, glycopeptide-linked oligosaccharides are first 
"trimmed," either in whole or in part, to expose either an acceptor for the sialyltransferase or 
a moiety to which one or more appropriate residues can be added to obtain a suitable 
acceptor. Enzymes such as glycosyltransferases and endoglycosidases (see, for example U.S. 
25 Patent No. 5,7 1 6,8 1 2) are useful for the attaching and trimming reactions. 

[0406] In the discussion that follows, the method of the invention is exemplified by the use 
of modified sugars having a water-soluble polymer attached thereto. The focus of the 
discussion is for clarity of illustration. Those of skill will appreciate that the discussion is 
equally relevant to those embodiments in which the modified sugar bears a therapeutic 
30 moiety, biomolecule or the like. 

[0407] In an exemplary embodiment, an O-linked carbohydrate residue is "trimmed" prior 
to the addition of the modified sugar. For example a GalNAc-Gal residue is trimmed back to 
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GalNAc. A modified sugar bearing a water-soluble polymer is conjugated to one or more of 
the sugar residues exposed by the "trimming." In one example, a glycopeptide is "trimmed" 
and a water-soluble polymer is added to the resulting O-side chain amino acid or 
glycopeptide glycan via a saccharyl moiety, e.g., Sia, Gal or GalNAc moiety conjugated to 
5 the water-soluble polymer. The modified saccharyl moiety is attached to an acceptor site on 
the "trimmed" glycopeptide. Alternatively, an unmodified saccharyl moiety, e.g., Gal can be 
added the terminus of the O-linked glycan. 

[0408] In another exemplary embodiment, a water-soluble polymer is added to a GalNAc 
residue via a modified sugar having a galactose residue. Alternatively, an unmodified Gal 
1 0 can be added to the terminal GalNAc residue. 

[0409] In yet a further example, a water-soluble polymer is added onto a Gal residue using 
a modified sialic acid. 

[0410] In another exemplary embodiment, an O-linked glycosyl residue is "trimmed back" 
to the GalNAc attached to the amino acid. In one example, a water-soluble polymer is added 
1 5 via a Gal modified with the polymer. Alternatively, an unmodified Gal is added to the 
GalNAc, followed by a Gal with an attached water-soluble polymer. In yet another 
embodiment, one or more unmodified Gal residue is added to the GalNAc, followed by a 
sialic acid moiety modified with a water-soluble polymer. 

[0411] The exemplary embodiments discussed above provide an illustration of the power of 
20 the methods set forth herein. Using the methods of the invention, it is possible to "trim back" 
and build up a carbohydrate residue of substantially any desired structure. The modified 
sugar can be added to the termini of the carbohydrate moiety as set forth above, or it can be 
intermediate between the peptide core and the terminus of the carbohydrate. 

[0412] In an exemplary embodiment, the water-soluble polymer is added to a terminal Gal 
25 residue using a polymer modified sialic acid. An appropriate sialyltransferase is used to add 
a modified sialic acid. The approach is summarized in Scheme 5. 
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Scheme 5 
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[0413] In yet a further approach, summarized in Scheme 6, a masked reactive functionality 
is present on the sialic acid. The masked reactive group is preferably unaffected by the 
conditions used to attach the modified sialic acid to the peptide. After the covalent 
attachment of the modified sialic acid to the peptide, the mask is removed and the peptide is 
conjugated with an agent such as PEG, PPG, a therapeutic moiety, biomolecule or other 
agent. The agent is conjugated to the peptide in a specific manner by its reaction with the 
unmasked reactive group on the modified sugar residue. 
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[0414] Any modified sugar can be used with its appropriate glycosyltransferase, depending 
on the terminal sugars of the oligosaccharide side chains of the glycopeptide (Table 3). As 
discussed above, the terminal sugar of the glycopeptide required for introduction of the 
PEG-ylated or PPGylated structure can be introduced naturally during expression or it can be 
produced post expression using the appropriate glycosidase(s), glycosyltransferase(s) or mix 
of glycosidase(s) and glycosyltransferase(s). 
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R, Rj- 4 = H, Linker-M, M. 

M - Ligand of interest 



Ligand of interest = acyl-PEG, acyl-PPG, alkyl-PEG, acyl-alkyl-PEG, 
acyl-alkyl-PEG, carbamoyl-PEG, carbamoyl-PPG, PEG, PPG, 
acyl-aryl-PEG, acyl-aryl-PPG, aryl-PEG, aryl-PPG, 
Mannose-g-phosphate, heparin, heparan, SLex, Mannose, FGF, VFGF, 
protein, chondroitin, keratan, dermatan, albumin, integrins, peptides, 
etc. 
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[0415] In an alternative embodiment, the modified sugar is added directly to the peptide 
backbone using a glycosyltransferase known to transfer sugar residues to the O-linked 
glycosylation site on the peptide backbone. This exemplary embodiment is set forth in 
Scheme 7. Exemplary glycosyltransferases useful in practicing the present invention include, 
but are not limited to, GalNAc transferases (GalNAc Tl-20), GlcNAc transferases, 
fucosyltransferases, glucosyltransferases, xylosyltransferases, mannosyltransferases and the 
like. Use of this approach allows the direct addition of modified sugars onto peptides that 

i 

lack any carbohydrates or, alternatively, onto existing glycopeptides. In both cases, the 
addition of the modified sugar occurs at specific positions on the peptide backbone as defined 
by the substrate specificity of the glycosyltransferase and not in a random manner as occurs 
during modification of a protein's peptide backbone using chemical methods. An array of 
agents can be introduced into proteins or glycopeptides that lack the glycosyltransferase 
substrate peptide sequence by engineering the appropriate amino acid sequence into the . 
polypeptide chain. 

Scheme 7 
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[0416] In each of the exemplary embodiments set forth above, one or more additional 
chemical or enzymatic modification steps can be utilized following the conjugation of the 
modified sugar to the peptide. In an exemplary embodiment, an enzyme (e.g., 
fucosyltransferase) is used to append a glycosyl unit (e.g., fucose) onto the terminal modified 
sugar attached to the peptide. In another example, an enzymatic reaction is utilized to "cap" 
(e.g., sialylate) sites to which the modified sugar failed to conjugate. Alternatively, a 
chemical reaction is utilized to alter the structure of the conjugated modified sugar. For 
example, the conjugated modified sugar is reacted with agents that stabilize or destabilize its 
linkage with the peptide component to which the modified sugar is attached. In another 
example, a component of the modified sugar is deprotected following its conjugation to the 
peptide. One of skill will appreciate that there is an array of enzymatic and chemical 
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procedures that are useful in the methods of the invention at a stage after the modified sugar 
is conjugated to the peptide. Further elaboration of the modified sugar-peptide conjugate is 
within the scope of the invention. 

[0417] In another exemplary embodiment, the glycopeptide is conjugated to a targeting 
5 agent, e.g. , transferrin (to deliver the peptide across the blood-brain barrier, and to 

endosomes), carnitine (to deliver the peptide to muscle cells; see, for example, LeBorgne et 
ah, Biochem. Pharmacol. 59: 1357-63 (2000), and phosphonates, e.g., bisphosphonate (to 
target the peptide to bone and other calciferous tissues; see, for example, Modern Drug 
Discovery, August 2002, page 10). Other agents useful for targeting are apparent to those of 
10 skill in the art. For example, glucose, glutamine and IGF are also useful to target muscle. 

[0418] The targeting moiety and therapeutic peptide are conjugated by any method 
discussed herein or otherwise known in the art. Those of skill will appreciate that peptides in 
addition to those set forth above can also be derivatized as set forth herein. Exemplary 
peptides are set forth in the Appendix attached to copending, commonly owned US 
15 Provisional Patent Application No. 60/328,523 filed October 10, 2001 . 

[0419] In an exemplary embodiment, the targeting agent and the therapeutic peptide are 
coupled via a linker moiety. In this embodiment, at least one of the therapeutic peptide or the 
targeting agent is coupled to the linker moiety via an intact glycosyl linking group according 
to a method of the invention. In an exemplary embodiment, the linker moiety includes a 
20 poly(ether) such as poly (ethylene glycol). In another exemplary embodiment, the linker 
moiety includes at least one bond that is degraded in vivo, releasing the therapeutic peptide 
from the targeting agent, following delivery of the conjugate to the targeted tissue or region 
of the body. 

[0420] In yet another exemplary embodiment, the in vivo distribution of the therapeutic 
25 moiety is altered via altering a glycoform on the therapeutic moiety without conjugating the 
therapeutic peptide to a targeting moiety. For example, the therapeutic peptide can be 
shunted away from uptake by the reticuloendothelial system by capping a terminal galactose 
moiety of a glycosyl group with sialic acid (or a derivative thereof). 

i. Enzymes 
30 1. Glycosyltransferases 

[0421] Glycosyltransferases catalyze the addition of activated sugars (donor NDP-sugars), 
in a step-wise fashion, to a protein, glycopeptide, lipid or glycolipid or to the non-reducing 
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end of a growing oligosaccharide. N-linked glycopeptides are synthesized via a transferase 
and a lipid-linked oligosaccharide donor Dol-PP-NAG 2 Glc 3 Man9 in an en block transfer 
followed by trimming of the core. In this case the nature of the "core" saccharide is 
somewhat different from subsequent attachments. A very large number of 
5 glycosyltransferases are known in the art. 

[0422] The glycosyltransferase to be used in the present invention may be any as long as it 
can utilize the modified sugar as a sugar donor. Examples of such enzymes include Leloir 
pathway glycosyltransferase, such as galactosyltransferase, N-acetylglucosaminyltransferase, 
N-acetylgalactosaminyltransferase, fucosyltransferase, sialyltransferase, mannosyltransferase, 
10 xylosyltransferase, glucurononyltransferase and the like. 

* 

i 
i 

^ 

[0423] For enzymatic saccharide syntheses that involve glycosyltransferase reactions, 
glycosyltransferase can be cloned, or isolated from any source. Many cloned 
glycosyltransferases are known, as are their polynucleotide sequences. See, e.g., "The WWW 
Guide To Cloned Glycosyltransferases," ( http : //www, vei . co .uk/TGN/gt guide .htm) . 
1 5 Glycosyltransferase amino acid sequences and nucleotide sequences encoding 

glycosyltransferases from which the amino acid sequences can be deduced are also found in 
various publicly available databases, including GenBank, Swiss-Prot, EMBL, and others. 

[0424] Glycosyltransferases that can be employed in the methods of the invention include, 
but are not limited to, galactosyltransferases, fucosyltransferases, glucosyltransferases, N- 
20 acetylgalactosaminyltransferases, N-acetylglucosaminyltransferases, glucuronyltransferases, 
sialyltransferases, mannosyltransferases, glucuronic acid transferases, galacturonic acid 
transferases, and oligosaccharyltransferases. Suitable glycosyltransferases include those 
obtained from eukaryotes, as well as from prokaryotes. 

[0425] DNA encoding glycosyltransferases may be obtained by chemical synthesis, by 
25 screening reverse transcripts of mRNA from appropriate cells or cell line cultures, by 

screening genomic libraries from appropriate cells, or by combinations of these procedures. 
Screening of mRNA or genomic DNA may be carried out with oligonucleotide probes 
generated from the glycosyltransferases gene sequence. Probes may be labeled with a 
detectable group such as a fluorescent group, a radioactive atom or a chemiluminescent group 
30 in accordance with known procedures and used in conventional hybridization assays. In the 
alternative, glycosyltransferases gene sequences may be obtained by use of the polymerase 
chain reaction (PCR) procedure, with the PCR oligonucleotide primers being produced from 
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the glycosyltransferases gene sequence. See, U.S. Pat. No. 4,683,195 to Mullis et ah and U.S. 
Pat. No. 4,683,202 to Mullis. 

[0426] The glycosyltransferase may be synthesized in host cells transformed with vectors 
containing DNA encoding the glycosyltransferases enzyme. Vectors are used either to 
5 amplify DNA encoding the glycosyltransferases enzyme and/or to express DNA which 
encodes the glycosyltransferases enzyme. An expression vector is a replicable DNA 
construct in which a DNA sequence encoding the glycosyltransferases enzyme is operably 
linked to suitable control sequences capable of effecting the expression of the 
glycosyltransferases enzyme in a suitable host. The need for such control sequences will 

1 0 vary depending upon the host selected and the transformation method chosen. Generally, 

control sequences include a transcriptional promoter, an optional operator sequence to control 
transcription, a sequence encoding suitable mRNA ribosomal binding sites, and sequences 
which control the termination of transcription and translation. Amplification vectors do not 
require expression control domains. All that is needed is the ability to replicate in a host, 

1 5 usually conferred by an origin of replication, and a selection gene to facilitate recognition of 
transformants. 

[0427] In an exemplary embodiment, the invention utilizes a prokaryotic enzyme. Such 
glycosyltransferases include enzymes involved in synthesis of lipooligosaccharides (LOS), 
which are produced by many gram negative bacteria (Preston et al, Critical Reviews in 

20 Microbiology 23(3): 139-1 80 (1996)). Such enzymes include, but are not limited to, the 
proteins of the rfa operons of species such as E. coli and Salmonella typhimurium, which 
include a pi,6 galactosyltransferase and a pi, 3 galactosyltransferase (see, e.g., EMBL 
Accession Nos. M80599 and M86935 (E. coli); EMBL Accession No. S56361 (S. 
typhimurium)), a glucosyltransferase (Swiss-Prot Accession No. P25740 (E. coli), an P 1,2- 

25 glucosyltransferase (r/&J)(Swiss-Prot Accession No. P27129 (E. coli) and Swiss-Prot 

Accession No. P19817 (S. typhimurium)), and an pi,2-N-acetylglucosaminyltransferase 
(r^fK)(EMBL Accession No. U00039 (E. coli). Other glycosyltransferases for which amino 
acid sequences are known include those that are encoded by operons such as rfaB, which 
have been characterized in organisms such as Klebsiella pneumoniae, E, coli, Salmonella 

30 typhimurium, Salmonella enterica, Yersinia enterocolitica, Mycobacterium leprosum, and the 
rhl operon of Pseudomonas aeruginosa. 
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[0428] Also suitable for use in the present invention are glycosyltransferases that are 
involved in producing structures containing lacto-N-neotetraose, D-galactosyl-p-l 5 4-N- 
acetyl-D-glucosaminyl-p-l 3 3-D-galactosyl-p-l ? 4-D-glucose ? and the P k blood group 
trisaccharide sequence, D-galactosyl-a-l 3 4-D-galactosyl-p-l,4-D-glucose, which have been 
5 identified in the LOS of the mucosal pathogens Neisseria gonnorhoeae and N. meningitidis 
(Scholten et al, J. Med Microbiol. 41: 236-243 (1994)). The genes from N. meningitidis and 
N gonorrhoeae that encode the glycosyltransferases involved in the biosynthesis of these 
structures have been identified from N. meningitidis immunotypes L3 and LI (Jennings et al, 
Mol Microbiol 18: 729-740 (1995)) and the K gonorrhoeae mutant F62 (Gotshlich, J. Exp. 

10 Med. 180: 2181-2190 (1994)). In N. meningitidis, a locus consisting of three genes, lgtA 9 

IgtB and Ig E, encodes the glycosyltransferase enzymes required for addition of the last three 
of the sugars in the lacto-iV-neotetraose chain (Wakarchuk et al, J. Biol. Chem. 271: 19166- 
73 (1996)). Recently the enzymatic activity of the IgtB and IgtA gene product was 
demonstrated, providing the first direct evidence for their proposed glycosyltransferase 

1 5 function (Wakarchuk et al, J. Biol. Chem. 271(45): 28271-276 (1 996)). In N gonorrhoeae, 
there are two additional genes, IgtD which adds p-D-GalNAc to the 3 position of the terminal 
galactose of the lacto-iV-neotetraose structure and IgtC which adds a terminal a-D-Gal to the 
lactose element of a truncated LOS, thus creating the P k blood group antigen structure 
(Gotshlich (1994), supra.). In K meningitidis, a separate immunotype LI also expresses the 

20 P k blood group antigen and has been shown to carry an IgtC gene (Jennings et al, (1995), 
supra.). Neisseria glycosyltransferases and associated genes are also described in USPN 
5,545,553 (Gotschlich). Genes for al,2~fucosyltransferase and al,3-fucosyltransferase from 
Helicobacter pylori has also been characterized (Martin et al, J. Biol. Chem. 272: 21349- 
21356 (1997)). Also of use in the present invention are the glycosyltransferases of 

25 Campylobacter jejuni (see, for example, http://afinb.ciirs-mrs.fr/-pedro/CAZY/gtf_42.html). 

a) Fucosyl transferases 

[0429] In some embodiments, a glycosyltransferase used in the method of the invention is a 
fucosyltransferase. Fucosyltransferases are known to those of skill in the art. Exemplary 
fucosyltransferases include enzymes, which transfer L-fucose from GDP-fucose to a hydroxy 
30 position of an acceptor sugar. Fucosyltransferases that transfer non-nucleotide sugars to an 
acceptor are also of use in the present invention. 
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[0430] In some embodiments, the acceptor sugar is, for example, the GlcNAc in a 
Galp(l->3,4)GlcNAcp- group in an oligosaccharide glycoside. Suitable fucosyltransferases 
for this reaction include the Gal(3(l->3 ? 4)GlcNAcpi-a(l-^3,4)fucosyltransferase (FTIII E.G. 
No. 2.4.1.65), which was first characterized from human milk (see, Palcic, et al, 
5 Carbohydrate Res. 190: 1-11 (1989); Prieels, et al, J. Biol. Chem. 256: 10456-10463 (1981); 
and Nunez, et al, Can. J. Chem. 59: 2086-2095 (1981)) and the Gaip(l->4)GlcNAcP- 
afucosyltransferases (FTIV, FTV, FTVI) which are found in human serum. FTVII (E.G. No. 
2.4.1.65), a sialyl a(2->3)Gaip((l->3)GlcNAcP fucosyltransferase, has also been 
characterized. A recombinant form of the Gaip(1^3,4) GlcNAcP- 

10 a(l-^3,4)fucosyltransferase has also been characterized (see, Dumas, et al, Bioorg. Med. 
Letters 1: 425-428 (1991) and Kukowska-Latallo, et al, Genes and Development 4: 1288- 
1303 (1990)). Other exemplary fucosyltransferases include, for example, ocl,2 
fucosyltransferase (E.G. No. 2.4.1.69). Enzymatic fucosylation can be carried out by the 
methods described in Mollicone, et al, Eur. J. Biochem. 191: 169-176 (1990) or U.S. Patent 

15 No. 5,374,655. Cells that are used to produce a fucosyltransferase will also include an 
enzymatic system for synthesizing GDP-fucose. 

b) Galactosyltransferases 

« 

[0431] In another group of embodiments, the glycosyltransferase is a galactosyltransferase. 
Exemplary galactosyltransferases include oc(l,3) galactosyltransferases (E.G. No. 2.4.1.151, 

20 see, e.g., Dabkowski et al, Transplant Proc. 25:2921 (1993) and Yamamoto et al Nature 
345: 229-233 (1990), bovine (GenBankj 04989, Joziasse et al, J. Biol Chem. 264: 14290- 
14297 (1989)), murine (GenBank m26925; Larsen et al, Proc. Natl. Acad. Sci. USA 86: 
8227-8231 (1989)), porcine (GenBank L3 6 152; Sirahanetal, Immunogene tics 41: 101-105 
(1995)). Another suitable al,3 galactosyltransferase is that which is involved in synthesis of 

25 the blood group B antigen (EC 2.4.1.37, Yamamoto et al, J. Biol Chem. 265: 1 146-1 151 
(1990) (human)). Yet a further exemplary galactosyltransferase is core Gal-TL 

[0432] Also suitable for use in the methods of the invention are p(l,4) 
galactosyltransferases, which include, for example, EC 2.4.1.90 (LacNAc synthetase) and EC 
2.4.1.22 (lactose synthetase) (bovine (D'Agostaro et al, Eur. J. Biochem. 183: 211-217 
30 (1989)), human (Masri et.al, Biochem. Biophys. Res. Commun. 157: 657-663 (1988)), murine 
(Nakazawa et al, J. Biochem. 104: 165-168 (1988)), as well as E.G. 2.4.1.38 and the 
ceramide galactosyltransferase (EC 2.4.1.45, Stahl et al, J. Neurosci. Res. 38: 234-242 
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(1994)). Other suitable galactosyltraasferases include, for example, ocl,2 
galactosyltransferases (from e.g., Schizosaccharomyces pombe, Chapell et al, Mol Biol. Cell 
5: 519-528(1994)). 

[0433] Also suitable in the practice of the invention are r soluble forms of al, 3- 
5 galactosyltransferase such as that reported by Cho,S.K. andCummings,R.D. (1997) J. Biol. 
Chem., 272, 13622-13628. 

c) Sialyltransferases 

[0434] Sialyltransferases are another type of glycosyltransferase that is useful in the 
recombinant cells and reaction mixtures of the invention. Cells that produce recombinant 

10 sialyltransferases will also produce CMP-sialic acid, which is a sialic acid donor for 

sialyltransferases. Examples of sialyltransferases that are suitable for use in the present 
invention include ST3Gal III {e.g., a rat or human ST3Gal III), ST3Gal IV, ST3Gal I, ST6Gal 
I, ST3Gal V, ST6Gal II, ST6GalNAc I, ST6GalNAc II, and ST6GalNAc III (the 
sialyltransferase nomenclature used herein is as described in Tsuji et al, Glycobiology 6: v- 

15 xiv (1996)). An exemplary oc(2,3)sialyltransferase referred to as oc(2,3)sialyltransferase (EC 
2.4.99.6) transfers sialic acid to the non-reducing terminal Gal of a Gal P 1-^3 Glc disaccharide 
or glycoside. See, Van denEijnden et al., J. Biol. Chem. 256: 3159 (1981), Weinstein et al, 
J. Biol Chem. 257: 13845 (1982) and Wen et al, J. Biol Chem. 267: 21011 (1992). Another 
exemplary oc2,3-sialyltransferase (EC 2.4.99.4) transfers sialic acid to the non-reducing 

20 terminal Gal of the disaccharide or glycoside, see, Rearick et al, J. Biol Chem. 254: 4444 
(1979) and Gillespie et al, J. Biol Chem. 267: 21004 (1992). Further exemplary enzymes 
include Gal- p - 1 ,4-GlcN Ac a-2,6 sialyltransferase (See, Kurosawa et al Eur. J. Biochem. 
219: 375-381 (1994)). 

[0435] Preferably, for glycosylation of carbohydrates of glycopeptides the sialyltransferase 
25 will be able to transfer sialic acid to the sequence Gaip 1 ,4GlcNAc-, the most common 
penultimate sequence underlying the terminal sialic acid on fully sialylated carbohydrate 
structures (see, Table 5) . 
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Table 5: Sialyltransf erases which use the Gaipi ? 4GlcNAc sequence as an acceptor 
substrate 



Sialyltransferase 


Source 


Sequence(s) formed 


Ref. 




ivi ammanan 


JNeuAca2 5 6Galp 1 ,4GlcN Ac- 


i 


CTQPnl TTT 
O 1 3 Gal 111 


Mammalian 


NeuAcoc2 , 3Galp 1 ,4GlcN Ac- 
NeuAca2 , 3Gaip 1 , 3GlcN Ac- 


1 


o 1 jual IV 


ivi ammanan 


JN euAcaz , JCralp 1 ,4LrlCJN Ac- 
NeuAca2 , 3Galp 1 , 3GlcNAc- 


i 


ST6Gal II 


Mammalian 


NeuAca2 ,6Galp 1 ,4GlcNAc 




ST6Gal II 


photobacterium 


NeuAca2 , 6Gaip 1 ,4GlcNAc- 


2 


ST3Gal V 


iV. meningitides 
N. gonorrhoeae 


NeuAca2, 3Gaip 1 ,4GlcNAc- 


3 



1) Goochee al, Bio/Technology 9: 1347-1355 (1991) 



2) Yamamoto etf a/., Biochem. 120: 104-1 10 (1996) 
5 3) Gilbert et al, J. Biol Chem. 271: 28271-28276 (1996) 

[0436] An example of a sialyltransferase that is useful in the claimed methods is ST3Gal 
III, which is also referred to as a(2,3)sialyltransferase (EC 2.4.99.6). This enzyme catalyzes 
the transfer of sialic acid to the Gal of a Gaipi,3GlcNAc or Galpl,4GlcNAe glycoside (see, 
e.g., Wen et al, J. Biol. Chem. 267: 2101 1 (1992); Van den Eijnden et al, J. Biol Chem. 

10 256: 3159 (1991)) and is responsible for sialylation of asparagine-linked oligosaccharides in 
glycopeptides. The sialic acid is linked to a Gal with the formation of an a-linkage between 
the two saccharides. Bonding (linkage) between the saccharides is between the 2-position of 
NeuAc and the 3 -position of Gal. This particular enzyme can be isolated from rat liver 
(Weinstein et al, J. Biol Chem. 257: 13845 (1982)); the human cDNA (Sasaki et al (1993) 

15 J. Biol Chem. 268: 22782-22787; Kitagawa & Paulson (1994) J. Biol Chem. 269: 1394- 

1401) and genomic (Kitagawa et al (1996) J. Biol Chem. 271: 931-938) DNA sequences are 
known, facilitating production of this enzyme by recombinant expression. In another 
embodiment, the claimed sialylation methods use a rat ST3Gal III. 

[0437] Other exemplary sialyltransferases of use in the present invention include those 
20 isolated from Campylobacter jejuni, including the a(2,3). See, e.g, WO99/49051. 

[0438] Sialyltransferases other those listed in Table 5, are also useful in an economic and 
efficient large-scale process for sialylation of commercially important glycopeptides. As a 
simple test to find out the utility of these other enzymes, various amounts of each enzyme 
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(1-100 mU/mg protein) are reacted with asialo-ai AGP (at 1-10 mg/ml) to compare the 
ability of the sialyltransferase of interest to sialylate glycopeptides relative to either bovine 
ST6Gal I, ST3Gal III or both sialyltransferases. Alternatively, other glycopeptides or 
glycopeptides, or N-linked oligosaccharides enzymatically released from the peptide 
5 backbone can be used in place of asialo-ai AGP for this evaluation. Sialyltransferases with 
the ability to sialylate N-linked oligosaccharides of glycopeptides more efficiently than 
ST6Gal I are useful in a practical large-scale process for peptide sialylation (as illustrated for 
ST3Gal III in this disclosure). Other exemplary sialyltransferases are shown in Figure 10. 

d) GalNAc transferases 

1 0 [0439] N-acetylgalactosaminyltransferases are of use in practicing the present invention, 

particularly for binding a GalNAc moiety to an amino acid of the O-linked glycosylation site 
of the peptide. Suitable N-acetylgalactosaminyltransferases include, but are not limited to, 
a(l,3) N-acetylgalactosaminyltransferase, P(l ? 4) N-acetylgalactosaminyltransferases (Nagata 
et al, J. Biol Chem. 267: 12082-12089 (1992) and Smith et al, J. Biol Chem. 269: 15162 

15 (1994)) and polypeptide N-acetylgalactosaminyltransferase (Homa et al, J. Biol. Chem. 268: 
12609 (1993)). 

[0440] Production of proteins such as the enzyme GalNAc Ti-xx from cloned genes by 
genetic engineering is well known. See, eg., U.S. Pat. No. 4,761,371. One method involves 
collection of sufficient samples, then the amino acid sequence of the enzyme is determined 

20 by N-terminal sequencing. This information is then used to isolate a cDNA clone encoding a 
full-length (membrane bound) transferase which upon expression in the insect cell line Sf9 
resulted in the synthesis of a fully active enzyme. The acceptor specificity of the enzyme is 
then determined using a semiquantitative analysis of the amino acids surrounding known 
glycosylation sites in 1 6 different proteins followed by in vitro glycosylation studies of 

25 synthetic peptides. This work has demonstrated that certain amino acid residues are 

overrepresented in glycosylated peptide segments and that residues in specific positions 
surrounding glycosylated serine and threonine residues may have a more marked influence on 
acceptor efficiency than other amino acid moieties. 

2. Sulfotransferases 

30 [0441] The invention also provides methods for producing peptides that include sulfated 
molecules, including, for example sulfated polysaccharides such as heparin, heparan sulfate, 
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carragenen, and related compounds. Suitable sulfotransferases include, for example, 
chondroitin-6-sulphotransferase (chicken cDNA described by Fukuta et al. s J. Biol Chem. 
270: 18575-18580 (1995); GenBank Accession No. D49915), glycosaminoglycan N- 
acetylglucosamineN-deacetylase/N-sulphotransferase 1 (Dixon et ah, Genomics 26: 239-241 
5 (1995); UL 18918), and glycosaminoglycan N-acetylglucosamine N-deacetylase/N- 

sulphotransferase 2 (murine cDNA described in Orellana et ah, J. Biol Chem. 269: 2270- 
2276 (1994) and Eriksson et al, J. Biol Chem. 269: 10438-10443 (1994); human cDNA 
described in GenBank Accession No. U2304). 

3. Cell-Bound Glycosyltransferases 

10 [0442] In another embodiment, the enzymes utilized in the method of the invention are 
cell-bound glycosyltransferases. Although many soluble glycosyltransferases are known 
(see, for example, U.S. Pat. No. 5,032,519), glycosyltransferases are generally in membrane- 
bound form when associated with cells. Many of the membrane-bound enzymes studied thus 
far are considered to be intrinsic proteins; that is, they are not released from the membranes 

15 by sonication and require detergents for solubilization. Surface glycosyltransferases have 
been identified on the surfaces of vertebrate and invertebrate cells, and it has also been 
recognized that these surface transferases maintain catalytic activity under physiological 
conditions. However, the more recognized function of cell surface glycosyltransferases is for 
intercellular recognition (Roth, MOLECULAR Approaches to Supracellular Phenomena, 

20 1990). 

[0443] Methods have been developed to alter the glycosyltransferases expressed by cells. 
For example, Larsen et al, Proc. Natl. Acad. Sci. USA 86: 8227-8231 (1989), report a genetic 
approach to isolate cloned cDNA sequences that determine expression of cell surface 
oligosaccharide structures and their cognate glycosyltransferases. A cDNA library generated 
25 from mRNA isolated from a murine cell line known to express UDP-galactose:.(3.-D- 

galactosyl-l,4-N-acetyl-D-glucosaminide a-l,3-galactosyltransferase was transfected into 
COS-1 cells. The transfected cells were then cultured and assayed for a 1-3 
galactosyltransferase activity. 

[0444] Francisco et al 3 Proc. Natl Acad. Sci. USA 89: 2713-2717 (1992), disclose a 
30 method of anchoring p-lactamase to the external surface of Escherichia coli. A tripartite 
fusion consisting of (i) a signal sequence of an outer membrane protein, (ii) a membrane- 
spanning section of an outer membrane protein, and (iii) a complete mature p-lactamase 

■ 

118 



WO 2005/070138 PCT/US2005/000799 

sequence is produced resulting in an active surface bound p-lactamase molecule. However, 
the Francisco method is limited only to procaryotic cell systems and as recognized by the 
authors, requires the complete tripartite fusion for proper functioning. 

4. Fusion Proteins 

5 [0445] In other exemplary embodiments, the methods of the invention utilize fusion 

proteins that have more than one enzymatic activity that is involved in synthesis of a desired 
glycopeptide conjugate. The fusion polypeptides can be composed of, for example, a 
catalytically active domain of a glycosyltransferase that is joined to a catalytically active 
domain of an accessory enzyme. The accessory enzyme catalytic domain can, for example, 

1 0 catalyze a step in the formation of a nucleotide sugar that is a donor for the 

glycosyltransferase, or catalyze a reaction involved in a glycosyltransferase cycle. For 
example, a polynucleotide that encodes a glycosyltransferase can be joined, in-frame, to a 
polynucleotide that encodes an enzyme involved in nucleotide sugar synthesis. The resulting 
fusion protein can then catalyze not only the synthesis of the nucleotide sugar, but also the 

1 5 transfer of the sugar moiety to the acceptor molecule. The fusion protein can be two or more 
cycle enzymes linked into one expressible nucleotide sequence. In other embodiments the 
fusion protein includes the catalytically active ddmains of two or more glycosyltransferases. 
See, for example, 5,641,668. The modified glycopeptides of the present invention can be 
readily designed and manufactured utilizing various suitable fusion proteins {see, for 

20 example, PCT Patent Application PCT/CA98/01 1 80, which was published as WO 99/3 1224 
on June 24, 1999.) 

5. Immobilized Enzymes 

[0446] In addition to cell-bound enzymes, the present invention also provides for the use of 
enzymes that are immobilized on a solid and/or soluble support. In an exemplary 

25 embodiment, there is provided a glycosyltransferase that is conjugated to a PEG via an intact 
glycosyl linker according to the methods of the invention. The PEG-linker-enzyme conjugate 
is optionally attached to solid support. The use of solid supported enzymes in the methods of 
the invention simplifies the work up of the reaction mixture and purification of the reaction 
product, and also enables the facile recovery of the enzyme. The glycosyltransferase 

30 conjugate is utilized in the methods of the invention. Other combinations of enzymes and 
supports will be apparent to those of skill in the art. 
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Purification of Peptide Conjugates 

[0447] The products produced by the above processes can be used without purification. 
However, it is usually preferred to recover the product. Standard, well-known techniques for 
recovery of glycosylated saccharides such as thin or thick layer chromatography, column 
5 chromatography, ion exchange chromatography, or membrane filtration can be used. It is 
preferred to use membrane filtration, more preferably utilizing a reverse osmotic membrane, 
or one or more column chromatographic techniques for the recovery as is discussed 
hereinafter and in the literature cited herein. For instance, membrane filtration wherein the 
membranes have molecular weight cutoff of about 3000 to about 10,000 can be used to 

10 remove proteins such as glycosyl transferases. Nanofiltration or reverse osmosis can then be 
used to remove salts and/or purify the product saccharides (see, e.g., WO 98/15581). 
Nanofilter membranes are a class of reverse osmosis membranes that pass monovalent salts 
but retain polyvalent salts and uncharged solutes larger than about 100 to about 2,000 
Daltons, depending upon the membrane used. Thus, in a typical application, saccharides 

15 prepared by the methods of the present invention will be retained in the membrane and 
contaminating salts will pass through. 

[0448] If the modified glycoprotein is produced intracellularly, as a first step, the 
particulate debris, either host cells or lysed fragments, is removed, for example, by 
centrifugation or ultrafiltration; optionally, the protein may be concentrated with a 

20 commercially available protein concentration filter, followed by separating the polypeptide 
variant from other impurities by one or more steps selected from immunoaffmity 
chromatography, ion-exchange column fractionation (e.g., on diethylaminoethyl (DEAE) or 
matrices containing carboxymethyl or sulfopropyl groups), chromatography on Blue- 
Sepharose, CM Blue-Sepharose, MONO-Q, MONO-S, lentil lectin-Sepharose, WGA- 

25 Sepharose, Con A-Sepharose, Ether Toyopearl, Butyl Toyopearl, Phenyl Toyopearl, SP- 
Sepharose, or protein A Sepharose, SDS-PAGE chromatography, silica chromatography, 
chromatofocusing, reverse phase HPLC (e.g., silica gel with appended aliphatic groups), gel 
filtration using, e.g., Sephadex molecular sieve or size-exclusion chromatography, 
chromatography on columns that selectively bind the polypeptide, and ethanol or ammonium 

3 0 sulfate precipitation. 

[0449] Modified glycopeptides produced in culture are usually isolated by initial extraction 
from cells, enzymes, etc., followed by one or more concentration, salting-out, aqueous ion- 
exchange, or size-exclusion chromatography steps, e.g., SP Sepharose. Additionally, the 
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modified glycoprotein may be purified by affinity chromatography. HPLC may also be 
employed for one or more purification steps. 

[0450] A protease inhibitor, e.g., methylsulfonylfluoride (PMSF) may be included in any of 
the foregoing steps to inhibit proteolysis and antibiotics may be included to prevent the 
5 growth of adventitious contaminants. 

[0451] Within another embodiment, supernatants from systems which sproduce the 
modified glycopeptide of the invention are first concentrated using a commercially available 
protein concentration filter, for example, an Amicon or Millipore Pellicon ultrafiltration unit. 
Following the concentration step, the concentrate may be applied to a suitable purification 

1 0 matrix. For example, a suitable affinity matrix may comprise a ligand for the peptide, a lectin 
or antibody molecule bound to a suitable support. Alternatively, an anion-exchange resin 
may be employed, for example, a matrix or substrate having pendant DEAE groups. Suitable 
matrices include acrylamide, agarose, dextran, cellulose, or other types commonly employed 
in protein purification. Alternatively, a cation-exchange step may be employed. Suitable 

1 5 cation exchangers include various insoluble matrices comprising sulfopropyl or 
carboxymethyl groups. Sulfopropyl groups are particularly preferred. 

[0452] Finally, one or more RP-HPLC steps employing hydrophobic RP-HPLC media, e.g. , 
silica gel having pendant methyl or other aliphatic groups, may be employed to further purify 
a polypeptide variant composition. Some or all of the foregoing purification steps, in various 
20 combinations, can also be employed to provide a homogeneous modified glycoprotein. 

[0453] The modified glycopeptide of the invention resulting from a large-scale 
fermentation may be purified by methods analogous to those disclosed by Urdal et al. } J. 
Chromatog. 296: 171 (1984). This reference describes two sequential, RP-HPLC steps for 
purification of recombinant human IL-2 on a preparative HPLC column. Alternatively, 
25 techniques such as affinity chromatography may be utilized to purify the modified 
glycoprotein. 

Pharmaceutical Compositions 

[0454] Polypeptides modified at various O-linked glycosylation site according to the 
method of the present invention have a broad range of pharmaceutical applications. For 
30 example, modified erythropoietin (EPO) may be used for treating general anemia, aplastic 
anemia, chemo-induced injury (such as injury to bone marrow), chronic renal failure, 
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nephritis, and thalassemia. Modified EPO may be further used for treating neurological 
disorders such as brain/spine injury, multiple sclerosis, and Alzheimer's disease. 

[0455] A second example is interferon-a (IFN-a), which may be used for treating AIDS 
and hepatitis B or C, viral infections caused by a variety of viruses such as human papilloma 
5 virus (HBV), coronavirus, human immunodeficiency virus (HIV), herpes simplex virus 

(HSV), and varicella-zoster virus (VZV), cancers such as hairy cell leukemia, AIDS-related 
Kaposi's sarcoma, malignant melanoma, follicular non-Hodgkins lymphoma, Philladephia 
chromosome (Ph)-positive, chronic phase myelogenous leukemia (CML), renal cancer, 
myeloma, chronic myelogenous leukemia, cancers of the head and neck, bone cancers, as 

1 0 well as cervical dysplasia and disorders of the central nervous system (CNS) such as multiple 
sclerosis. In addition, IFN-a modified according to the methods of the present invention is 
useful for treating an assortment of other diseases and conditions such as Sjogren's symdrome 
(an autoimmune disease), Behcet's disease (an autoimmune inflammatory disease), 
fibromyalgia (a musculoskeletal pain/fatigue disorder), aphthous ulcer (canker sores), chronic 

1 5 fatigue syndrome, and pulmonary fibrosis. 

[0456] Another example is interferon-p, which is useful for treating CNS disorders such as 
multiple sclerosis (either relapsing/remitting or chronic progressive), AIDS and hepatitis B or 
C, viral infections caused by a variety of viruses such as human papilloma virus (HBV), 
human immunodeficiency virus (HIV), herpes simplex virus (HSV), and varicella-zoster 

20 virus (VZV), otological infections, musculoskeletal infections, as well as cancers including 
breast cancer, brain cancer, colorectal cancer, non-small cell lung cancer, head and neck 
cancer, basal cell cancer, cervical dysplasia, melanoma, skin cancer, and liver cancer. IFN-(3 
modified according to the methods of the present invention is also used in treating other 
diseases and conditions such as transplant rejection (e.g., bone marrow transplant), 

25 Huntington's chorea, colitis, brain inflammation, pulmonary fibrosis, macular degeneration, 
hepatic cirrhosis, and keratoconjunctivitis. 

[0457] Granulocyte colony stimulating factor (G-CSF) is a further example. G-CSF 
modified according to the methods of the present invention may be used as an adjunct in 
chemotherapy for treating cancers, and to prevent or alleviate conditions or complications 
30 associated with certain medical procedures, e.g. , chemo-induced bone marrow injury; 

leucopenia (general); chemo-induced febrile neutropenia; neutropenia associated with bone 
marrow transplants; and severe, chronic neutropenia. Modified G-CSF may also be used for 
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transplantation; peripheral blood cell mobilization; mobilization of peripheral blood 
progenitor cells for collection in patients who will receive myeloablative or myelosuppressive 
chemotherapy; and reduction in duration of neutropenia, fever, antibiotic use, hospitalization 
following induction/consolidation treatment for acute myeloid leukemia (AML). Other 
5 condictions or disorders may be treated with modified G-CSF include asthma and allergic 
rhinitis. 

[0458] As one additional example, human growth hormone (hGH) modified according to 
the methods of the present invention may be used to treat growth-related conditions such as 
dwarfism, short-stature in children and adults, cachexia/muscle wasting, general muscular 

10 atrophy, and sex chromosome abnormality (e.g., Turner's Syndrome). Other conditions may 
be treated using modified hGH include: short-bowel syndrome, lipodystrophy, osteoporosis, 
uraemaia, burns, female infertility, bone regeneration, general diabetes, type II diabetes, 
osteo-arthritis, chronic obstructive pulmonary disease (COPD), and insomia. Moreover, 
modified hGH may also be used to promote various processes, e.g., general tissue 

15 regeneration, bone regeneration, and wound healing, or as a vaccine adjunct. 

[0459] Thus, in another aspect, the invention provides a pharmaceutical composition. The 
pharmaceutical composition includes a pharmaceutically acceptable diluent and a covalent 

i 

conjugate between a non-naturally-occurring, water-soluble polymer, therapeutic moiety or 
biomolecule and a glycosylated or non-glycosylated peptide. The polymer, therapeutic 
20 moiety or biomolecule is conjugated to the peptide via an intact glycosyl linking group 

interposed between and covalently linked to both the peptide and the polymer, therapeutic 
moiety or biomolecule. 4 

[0460] Pharmaceutical compositions of the invention are suitable for use in a variety of 
drug delivery systems. Suitable formulations for use in the present invention are found in 
25 Remington's Pharmaceutical Sciences, Mace Publishing Company, Philadelphia, PA, 17th 
ed. (1985). For a brief review of methods for drug delivery, see, Langer, Science 249:1527- 
1533 (1990). 

[0461] The pharmaceutical compositions may be formulated for any appropriate manner of 
administration, including for example, topical, oral, nasal, intravenous, intracranial, 
30 intraperitoneal, subcutaneous or intramuscular administration. For parenteral administration, 
such as subcutaneous injection, the carrier preferably comprises water, saline, alcohol, a fat, a 
wax or a buffer. For oral administration, any of the above carriers or a solid carrier, such as 
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mannitol, lactose, starch, magnesium stearate, sodium saccharine, talcum, cellulose, glucose, 
sucrose, and magnesium carbonate, may be employed. Biodegradable matrises, such as 
microspheres (e.g., polylactate polyglycolate), may also be employed as carriers for the 
pharmaceutical compositions of this invention. Suitable biodegradable microspheres are 
5 disclosed, for example, in U.S. Patent Nos. 4,897,268 and 5,075,109. 

[0462] Commonly, the pharmaceutical compositions are administered subcutaneously or 
parenterally, e.g., intravenously. Thus, the invention provides compositions for parenteral 
administration which comprise the compound dissolved or suspended in an acceptable 
carrier, preferably an aqueous carrier, e.g., water, buffered water, saline, PBS and the like. 
10 The compositions may also contain, detergents such as Tween 20 and Tween 80; stablizers 
such as mannitol, sorbitol, sucrose, and trehalose; and preservatives such as EDTA and m- 
cresol. The compositions may contain pharmaceutically acceptable auxiliary substances as 
required to approximate physiological conditions, such as pH adjusting and buffering agents, 
tonicity adjusting agents, wetting agents, detergents and the like. 

15 [0463] These compositions may be sterilized by conventional sterilization techniques, or 
may be sterile filtered. The resulting aqueous solutions may be packaged for use as is, or 
lyophilized, the lyophilized preparation being combined with a sterile aqueous carrier prior to 
administration. The pH of the preparations typically will be between 3 and 1 1 , more 
preferably from 5 to 9 and most preferably from 7 and 8. 

20 [0464] In some embodiments the glycopeptides of the invention can be incorporated into 
liposomes formed from standard vesicle-forming lipids. A variety of methods are available 
for preparing liposomes, as described in, e.g., Szoka et al,Ann. Rev. Biophys. Bioeng. 9: 467 
(1980), U.S. Pat. Nos. 4,235,871, 4,501,728 and 4,837,028. The targeting of liposomes using 
a variety of targeting agents (e.g., the sialyl galactosides of the invention) is well known in 

25 the art (see, e.g., U.S. Patent Nos. 4,957,773 and 4,603,044). 

[0465] Standard methods for coupling targeting agents to liposomes can be used. These 
methods generally involve incorporation into liposomes of lipid components, such as 
phosphatidylethanolamine, which can be activated for attachment of targeting agents, or 
derivatized lipophilic compounds, such as lipid-derivatized glycopeptides of the invention. 

30 [0466] Targeting mechanisms generally require that the targeting agents be positioned on 
the surface of the liposome in such a manner that the target moieties are available for 
interaction with the target, for example, a cell surface receptor. The carbohydrates of the 
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invention may be attached to a lipid molecule before the liposome is formed using methods 
known to those of skill in the art (e.g., alkylation or acylation of a hydroxyl group present on 
the carbohydrate with a long chain alkyl halide or with a fatty acid, respectively). 
Alternatively, the liposome may be fashioned in such a way that a connector portion is first 
5 incorporated into the membrane at the time of forming the membrane. The connector portion 
must have a lipophilic portion, which is firmly embedded and anchored in the membrane. It 
must also have a reactive portion, which is chemically available on the aqueous surface of the 
liposome. The reactive portion is selected so that it will be chemically suitable to form a 
stable chemical bond with the targeting agent or carbohydrate, which is added later. In some 
10 cases it is possible to attach the target agent to the connector molecule directly, but in most 
instances it is more suitable to use a third molecule to act as a chemical bridge, thus linking 
the connector molecule which is in the membrane with the target agent or carbohydrate which 
is extended, three dimensionally, off of the vesicle surface. 

[0467] The compounds prepared by the methods of the invention may also find use as 
15 diagnostic reagents. For example, labeled compounds can be used to locate areas of 

inflammation or tumor metastasis in a patient suspected of having an inflammation. For this 
use, the compounds can be labeled with 125 I, 14 C, or tritium. 

[0468] The following examples are provided to illustrate the conjugates, and methods and 
of the present invention, but not to limit the claimed invention. 

20 EXAMPLES 
EXAMPLE 1 

1.1a Preparation of Interferon alpha-2/3-GalNAc (pH 6.2) 

[0469] Interferon alpha~2|3 was reconstituted by adding 200 jllL water to 4 mg of IFN 
alpha-2p. When the solid was dissolved, 1 .92 mL reaction buffer (20 mM MES, pH 6.2, 150 
25 mM NaCl, 5 mM MgCl 2 , 5 mM MnCl 2 , 0.05% polysorbate, and 0.05% NaN 3 ), was added. 
UDP-GalNAc (4.16 mg; 3 mM) and GalNAc T2 (80 mU; 80 jxL) were then added and the 
reaction mixture was incubated at 32 °C with slow rotary movement. The reaction was 
monitored using MALDI analysis and was essentially complete after 72 h 

. Once complete, the reaction mixture was submitted for peptide mapping, and analysis of 
30 site occupancy. 
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1.1b Preparation of Interferon alpha-2J3~GalNAc (pH 7.4). 

[0470] The interferon alpha 2p was reconstituted as described by the manufacturer. Water, 
50 uL, was added to 50 ug of IFN alpha-2p. When the solid was dissolved, the reaction 
buffer (20 mM MES, pH 7.4, 150 mM NaCl, 5 mM MgCl 2 , 5 mM MnCl 2 , 0.05% 
polysorbate, and 0.05% NaN 3 ), 50 uL was added. The UDP-GalNAc (100 ug; 3 mM) and 
GalNAc T2 (8 mU; 8 uL) were then added and the reaction mixture incubated at 32 °C under 
a slow rotary movement. The reaction was monitored using MALDI analysis and was found 
to be complete within about 48 to 72 h 



1.2 Preparation ofInterferon-alpha-2p-GalNAc-SA-PEG-20kilodalton using CMP- 
SA-PEG and ST6GalNAcI 

[0471] The IFN-alpha-2p-GalNAc (1 .0 mL, ~2 mg, 0. 1 umole) from /. 1 (above) was 
buffer exchanged (2x) using a 5 kilodalton MWCO Filter Centricon cartridge and a second 
buffer (20 mM MES, pH 7.4, 150 mM NaCl, 5 mM MgCl 2 , 5 mM MnCl 2 , 0.05% 
polysorbate, and 0.05% NaN 3 ). The IFN-alpha-2p-GalNAc was reconstituted from the spin 
cartridge using the second buffer, 1.0 mL, and both CMP-SA-PEG-20kilodalton (10 mg, 0.5 
micromoles) and ST6GalNAcl (200 uL) were added to the reaction mixture. The reaction 
was incubated at 32 °C for 96 h with slow rotary movement. The product, IFN-alpha-2p- 
GalNAc-SA-PEG-20kilodalton was purified using SP Sepharose and SEC (Superdex 75) 
chromatography. The addition of sialic acid-PEG was verified using MALDI analysis . 

1.3. Preparation ofInterferon-alpha-2/3-GalNAc-Gal-SA-PEG-20kilodalton using 

T 

CMP-SA-PEG, core-l-pi,3-galactosyl-transf erase, andST3Gal2 

[0472] The IFN-alpha-2p-GalNAc (1.0 mL, ~2 mg, 0. 1 umole) from the addition of 
GalNAc described above (pH 6.2) was buffer exchanged (2x) using a 5 kilodalton MWCO 
Filter Centricon cartridge and a second buffer (20 mM MES, pH 7.4, 150 mM NaCl, 5 mM 
MgCl 2 , 5 mM MnCl 2 , 0.05% polysorbate, and 0.05% NaN 3 ). The IFN-alpha-2p-GalNAc was 
reconstituted from the spin cartridge using 1.0 mL of the second buffer, containing CMP-SA- 
PEG-20kilodalton (10 mg, 0.5 micromoles), UDP-Galactose (1.8 mg, 3 mM), core-l-pi,3- 
galactosyl-transferase (200 mU on resin) and ST3Gal2 (200 mU, a2,3~(0)-sialytransferase). 
The reaction mixture was incubated at 32 °C for 96 h with slow rotary movement. The 
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product, IFN-alpha-2p-GalNAc-Gal-SA-PEG-20kilodalton 3 was purified by SP Sepharose 
and SEC (Superdex 75) chromatography. The addition of sialic acid-PEG was verified using 
MALDI analysis. 

1.9 Protein Concentration Assay 

5 [0473] Protein concentration was determined using a spectrophotometer at a fixed 

absorbance of 280 nm with 1 cm path length of cell. Triplicate readings were measured for a 
tested sample with water and buffer as controls. Protein concentration was determined using 
extinction coefficient at 0.799 mL/mg protein. 

1.10 Formulation of Final Product 

1 0 [0474] The formulation buffer contained pyrogen-free PBS, pH 6.5, 2.5% mannitol, and 
0.05% Polysorbate 80 that was degassed by vacuum and sterile filtered (0.2 |nm). 

[0475] Endotoxin was removed using a Detoxi-Gel™ equilibrated with 5 column beds of 
the formulation buffer (PBS, pH 6.5, 2.5% mannitol, and 0.05% Polysorbate 80). The flow 
rate was controlled by gravity at - 0.3 mL/min. Product samples were applied onto the gel, 
1 5 and the product eluted using the formulation buffer. The volume of the collected product was 
adjusted with additional formulation buffer to provide a protein concentration of about 100 
p,g/mL. 

[0476] The peptide formulations were sterile filtered (0.2 jli) and the effluent was dispensed 
as 1 mL aliquots into 2.0 mL pyrogen-free vials. In addition, aliquots were taken for 
20 endotoxin and protein analysis. All products were stored at 4 °C. 

1.13 Pharmacokinetic Study 

[0477] The pharmacokinetic analysis was performed using radioiodinated protein. After 
administration of the labeled interferons by IV tail vein injections into the rats, the clearance 
rate was measured as the reduction in radioactivity in blood drawn at specific intervals over 
25 72 h. Each time point is a measure of at least five rats. 

1.14 Results 

[0478] The reaction rate of GalNAc-T2 was measured at two pH's, a neutral pH (7.4) and a 
slightly acidic pH (6.2). Glycosylation with GalNAc proceeded sucessfully at both pH 6.2 
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and pH 7.4. As can be seen in the MALDI analysis of the reaction progress, the reaction rate 
was faster at pH 7.4 than at pH 6.2. 

[0479] GalNAc-T2 and GalNAc were added to interferon alpha-2(3 quantitatively at either 
pH 6.2 or pH 7.4. The reaction was followed by MALDI. During the enzymatic reaction, a 
5 new interferon alpha mass ion formed (IFN-alpha-2b 19,281 Da and IFN-alpha-2p -GalNAc, 
19,485 Da). 

[0480] The product of the reaction at pH 6.2, IFN-alpha-2b-GalNAc, was submitted to 
analysis to determine the position of substitution of the GalNAc on the protein. Peptide 
mapping and site occupancy mapping were used for this purpose. Peptide mapping using 
1 0 TIC of LC-MS/MS and a GluC digest of IFN-alpha-2b produced a peptide fragment of mass 
1018.69. MS/MS peptide amino acid sequencing of the peptide mass ion of 1018.69 
containing the GalNAc indicated that sugar was attached to T 106 . 

[0481] The sialyl-PEGylation of IFN-alpha-2b-GalNAc was examined using ST6GalNAc- 1 
and CMP-SA-PEG-20 kilodalton. The reaction of IFN-alpha-2b -GalNAc produced the 
1 5 PEG-ylated protein, which was visible by SDS PAGE. In general, the reaction proceeded at 
32 °C for 96 h. The reaction was monitored by SDS PAGE. SDS PAGE indicated that about 
70% of the IFN-alpha-2b -GalNAc was converted to IFN-alpha-2b -GalNAc-SA-PEG-20 
kilodalton. The MALDI analysis of the new band indicated a mass ion of 41,500 Daltons, the 
mass of IFN-alpha-2b -GalNAc-SA-PEG-20 kilodalton. 

i 

20 [0482] The glycoform of PEG-ylated interferon alpha-2b containing the GalNAc-Gal-S A- 
PEG structure was also produced. The reaction was performed using the conditions 
described above. The desired product was detected by SDS PAGE. A one pot, two-step 
reaction was used to produce the desired product, beginning with IFN-alpha-2p -GalNAc with 
core-l-p3-galactosyltransferase-l, ST3Gal2, UDP-galactose and CMP-SA-PEG-20 

25 kilodalton. The reaction was incubated at 32 °C for 96 h. The reaction was monitored by 
SDS PAGE. After 24 h, the reaction was about 70% complete. The MALDI of the product 
indicated a mass ion of 41,900 Da, which originates from the desired IFN-alpha-2p-GalNAc- 
Gal-SA-PEG-20 kilodalton product. 

[0483] Both glycoforms of the PEG-ylated interferon alpha~2b products were purified 
30 using a two-step process. In the first step, ion-exchange chromatography was performed 
using SP Sepharose. This procedure removed unreacted PEG materials and provided some 
separation of other proteins. The ion exchange step was followed by separation on SEC. A 
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Superdex 75 column was used to remove remaining smaller proteins including the 
glycosyltransferases and unPEG-ylated interferon alpha. Both PEG-ylated glycoforms of 
interferon alpha were purified to greater than 90% as shown by SDS PAGE). 

[0484] The antiviral data indicates that PEG-ylated glycoforms A and B retain their 
5 antiviral effects). 

[0485] The radioiodinated PEG-ylated proteins were injected into rats via their tail veins, 
the AUG for both proteins was 5-7 fold greater un-PEG-ylated interferon alpha-2p. 

[0486] Glycoform A (IFN-alpha-2P-GalNAc-SA-PEG-20kilodalton) and B (IFN-alpha-2p- 
GalNAc-Gal-SA-PEG-20kilodalton) were both bioactive. 

10 EXAMPLE 2 

2.1 Preparation of G-CSF-GalNAc (pH 6.2) 

• [0487] 960 ug of G-CSF in 3 .2 mL of buffer was concentrated by utrafiltration using a UF 

l 

filter (5 kilodalton) and reconstituted with 1 mL of 25 mM MES buffer (pH 6.2, 0.005% 
NaN 3 ). UDP-GalNAc (6 mg, 9.24 mM), GalNAc-T2 (40 nL, 0.04 U), and 100 mM MnCl 2 

15 (40 jliL, 4 mM) were then added and the resulting solution was incubated at room temperature 
for 48 hours. After 48 hours, MALDI indicated the reaction was complete (shift of the mass 
ion from 18800 to 19023 mass units). The reaction mixture was purified by HPLC using 
SEC (Superdex 75 and Superdex 200). The column was eluted using phosphate buffered 
saline, pH 4.9 and 0.005% Tween 80. The peak corresponding to G-CSF-GalNAc was 

20 collected and concentrated to about 150 \iL using a Centricon 5 kilodalton filter and the 
volume was adjusted to 1 mL using PBS (phosphate buffered saline, pH 4.9 and 0.005% 
Tween 80); protein concentration was 1 mg/mL A280). 

2.2 Preparation of G-CSF-GalNAc-Gal (pH 6.0) 

[0488] G-CSF-GalNAc (100 ug) was added to a 100 uL of a solution containing 25 mM 
25 MES buffer, pH 6.0, 1 .5 mM UDP-GalNAc, 1 0 mM MgCl 2 and 80 mU GalNAc-T2. The 
CMP-SA-PEG-20 kilodalton (0.5 mg, 0.025 umole), UDP-galactose 75 ug (0.125 umole), 
core-l-Gal-T 20 uL (10 mU) were then added and the solution which was slowly rocked at 
32 °C for 24 hours. MALDI indicated complete conversion of G-CSF-GalNAc into G-CCSF- 
GalNAc-Gal. 
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2.3 Preparation of G-CSF-GalNAc-SA-PEG-20 kilodalton (C). 

2.3a Sequential Process (pH 6.2). 

[0489] A G-CSF-GalNAc solution containing 1 mg of protein was buffer exchanged into 
25 mM MES buffer (pH 6.2, 0.005% NaN 3 ) then 5 mg, (0.25 umole) CMP-SA-PEG 
(20kilodalton) was added. Finally, 100 uL, of a 100 mM MnCl 2 solution and ST6GalNAc-I 
(100 uL) were added and the reaction mixture was rocked slowly at 32 °C. Aliquots were 
taken at time points (24, 48 and 72 h) and analyzed by SDS-PAGE. After 24 h, no further 
reaction was observed. The reaction mixture was concentrated by spin filtration (5 
kilodalton), buffer exchanged against 25 mM NaOAc (pH 4.9) and concentrated to 1 mL. 
The product was purified using ion exchange (SP-Sepharose, 25 mM NaOAc, pH 4.9) and 
SEC (Superdex 75; PBS-pH 7.2, 0.005% tween 80, 1 ml/min). The desired fraction was 
collected, concentrated to 0.5 mL and stored at 4 °C. 

2.3b One Pot process using ST6GalNAc-I (pH 6.0) 

[0490] 960 ug of G-CSF protein dissolved in 3 .2 mL of product formulation buffer was 
concentrated by spin filtration (5 kilodalton) to 0.5 mL and reconstituted in 25 mM MES 
buffer (pH 6.0, 0.005% NaN 3 ) to a total volume of about 1 mL, or a protein concentration of 
1 mg/mL. Following reconstitution UDP-GalNAc (6 mg, 9.21 umol), GalNAc-T2 (80 uL, 80 
mU), CMP-SA-PEG-20 kilodalton (6 mg, 0,3 umol ) and mouse enzyme ST6GalNAc-I (120 
uL)were added. The solution was rocked at 32 °C for 48 hours. Following the reaction the 
productwas purified using standard chromatography conditions on SP-Sepharose and SEC as 
described above. A total of 0.5 mg of protein (A 280 ) was obtained, for about a 50% overall 
yield. The product structure was confirmed by analysis with both MALDI and SDS-PAGE 

2.4 Preparation of G-CSF-GalNAc-Gal-SA-PEG-20 kilodalton (D) 

2.4a Starting from G-CSF-GalNAc 

[0491] UDP-galactose (4 mg, 6.5 umole), core- 1 -Gal-Ti (320 uL, 160 mU), CMP-SA- 
PEG-20 kilodalton (8 mg, 0.4 umole), ST3Gal2 (80 uL, 0.07 mU) and 80 uL of 100 mM 
MnCl 2 were directly added to the crude 1.5 mL of reaction mixture of the G-CSF-GalNAc 
(1.5 mg) in 25 mM MES buffer (pH 6.0) from Example 2.1 (above). The resulting mixture 
was incubated at 32 °C for 60 hours, however, the reaction was complete after 24 h. The 
reaction mixture was centrifuged and the solution was concentrated to 0.2 mL using 
ultrafiltration (5 kilodalton) and then redissolved in 25 mM NaOAc (pH 4.5) to a final 
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volume of 1 mL. The product was purified using SP-Sepharose, the peak fractions were 
concentrated using a spin filter (5 kilodalton) and the residue purified further using SEC 
(Superdex 75). After concentration using a spin filter (5 kilodalton), the protein was diluted 
to 1 mL using formulation buffer consisting of PBS, 2.5% mannitol, 0.005% polysorbate, pH 
5 6.5, and formulated at a protein concentration of 850 jag protein per mL (A 28 o). The overall 
yield was 55%. The MALDI analysis is shown in FIG 28. 

2.4b Starting from G-CSF 

[0492] 960 ug, of G-CSF (3 .2 mL) was concentrated by spin filter (5 kilodalton) and 
reconstituted with 25 mM MES buffer (pH 6.0, 0.005% NaN 3 ). The total volume of the G- 
10 CSF solution was adjusted to about 1 mg/mL and UDP-GalNAc (6 mg), GalNAc-T2 (80 uL), 
UDP-galactose (6 mg), core-l-Gal-Ti (160 uL, 80 uU), CMP-SA-PEG (20 kilodalton) (6 
mg), ST3Gal-2 (160 uL, 120 pU) and MnCl 2 (40 uL of a 100 mM solution) were added. The 
resulting mixture was incubated at 32 °C for 48 h. 

2.5 SP Sepharose HPLC Chromatography 

1 5 [0493] The SP Sepharose was performed as described in Example 1 .4. 

2.6 Size Exclusion Chromatography 

[0494] SEC was performed as described in Example 1.5. The purified samples were stored 
at 4 °C. 

2.6a Hydrophobic interaction chromatography (HIC) 

20 Follwing the first step of chromatographic chromatography HIC can be used as a second 

purification step to remove contaminants other thn un-Pegylated G-CSF. Thus, a method is 
available for the purification of glycopegylated G-CSF that has been through an initial 
purificatio on a gel permeation column. 

2.7 SDS PAGE Analysis 

25 [0495] The SDS PAGE was performed as set forth in Example 1 .6. 

2.8 MALDI Analysis 

[0496] MALDI analysis was performed as described in Example 1 .7. 
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2.9 Peptide Mapping Analysis 



[0497] 



Protein mapping analysis was performed as illu°trate^in Ex^mpic-f .8 



2.10 



Protein Concentration Assay 



[0498] 



Protein concentration was determined as described in Example 1.9. 



5 



2.11 



Product Formulation 



[0499] The product was formulated as set forth in Example 1.10 

2.12 Endotoxin Determination 

[0500] Endotoxin was determined as set forth in Example 1.11. 

2.13 Cell proliferation assay 

10 [0501] A G-CSF proliferation assay with a NFS-60 cell line and a Tf-1 cell line were 
performed according to standard procedures. The cells were plated into a 96 well plate at 
25000 cell/ml in the presence of different concentrations of G-CSF (51 nM, 25.5 nM 5 12.75 
nM, 3.2 nM, 1.6 nM, 0.8 nM, 0 nM), a chemically PEG-ylated G-CSF analogue, and 
PEGylated G-CSF C from Example 2.3 (above), and PEGylated G-CSF D from Example 2.4 

15 (above). The cells were incubated at 37 °C for 48 hours. A colorimetric MTT assay was 
used to determine the cell viability. 

i 

2. 14 In Vivo Activity: White Blood Cell (WBC) Production in the Rat 

[0502] Two doses of drug (50 ug/kg, 250 ug/kg) were examined for each of C, G-CSF and 
a chemically PEG-ylated G-CSF using mice. Blood was drawn at time points of 2 hour, 12 
20 hour, 24 hour, 36 hour, 48 hour, 60 hour, 72 hour, 84 hour and 96 hour, and the WBC and 
neutrophil counts were measured (FIG. 4). 

2.15 Accelerated Stability Study 

[0503] An accelerated stability study of PEGylated G-CSF, C, from Example 2.3, and 
PEGylated G-CSF, D, from Example 2.4 was performed using a buffer at pH 8.0 heated to 40 
25 °C. 72 \xg of PEGylated G-CSF C, was diluted to 8 mL with formulation buffer (PBS, 2.5% 
mannitol, 0.005% polysorbate 80). 1 mg of PEGylated G-CSF D, was diluted with 16 mL of 
formulation buffer. Both solutions were adjusted to pH 8.0 with NaOH and the resulting 
solution was sterile filtered into pyrogen-free tubes. The samples were slowly rotated at 40 
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°C and aliquots (0.8 mL) were taken at timepoints of 0 hour, 72 hours and 168 hours. 
Analysis was performed using SEC (Superdex 200) as described above (FIG. 6 and FIG. 7). 



2.16 Protein Radiolabeling 

[0504] G-CSF was radiolabeled using the Bolton Hunter reagent. This reaction was 
5 performed at pH 7.4 for 15 minutes and was followed by a SEC (Superdex 200) purification. 
Once purified, the formulation buffer pH was adjusted to 5.0 and the protein concentration 
was determined by A280. 

2.17 ELISA Assay 

[0505] An Elisa assay was utilized to quantify the G-CSF derivatives in rat plasma. The 
10 pharmacokinetic results are shown in FIG. 9. 

2.18 Pharmacokinetic Study 

[0506] Two pharmacokinetic studies were performed. For the first pharmacokinetic study 
proteins were radiolabeled and administered by IV tail vein injections into rats. Clearance 
rate was measured as the reduction in radioactivity in blood drawn at specific intervals over 
15 48 hours. Each time point was a measure of at least five rats. 

[0507] Specifically, 1 0 jig of G-CSF derivative was injected per animal (-1 \ig of labeled 
protein and 9 |ng of unlabeled protein). In addition to the blood being drawn and counted as 
described above, plasma was also collected and the protein acid was precipitated. The 
protein pellets were then also counted for radioactivity. The data from these studies is shown 
20 is FIG. 2, FIG. 3 and FIG. 8. 

[0508] In the second pharmacokinetic study the unlabeled G-CSF derivatives (30 \ig per 
animal) were administered by IV tail vein injections into rats. Blood samples were drawn at 
the time points indicated and the samples analzed by the G-CSF ELISA assay. The data is 
shown in FIG. 9. 

25 2.19 Results 

[0509] Human GalNAc T2 transferred GalNAc to G-CSF expressed in E. coli \ using UDP- 
GalNAc as the donor. Depending on the pH of the reaction buffer, one or two GalNAc 
moities were added to G-CSF as determined by MALDI. Addition of the second GalNAc 
proceeded slowly amounting to about 10-15% of the total product. One GalNAc could be 
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selectively added to G-CSF, in conversion yields of over 90%, by adjusting the pH of the 
reaction solution to 6.0-6.2. Addition of the second GalNAc occurred when the reaction was 
performed at a pH between about 7.2 and 7.4. Both Co +2 and Mn 4 " 2 are useful divalent metal 
ions in the reaction. Peptide mapping of the reaction products indicated that the predominant 
5 product of the reaction was addition of GalNAc to threonine- 133, the natural site of O-linked 
glycosylation in mammalian systems \. The second GalNAc was observed in the amino 
terminal peptide fragment of G-CSF and is postulated to occur at threonine-2. 

[0510] The reaction of G-CSF-GalNAc with ST6GalNAc-l (chicken or mouse) and CMP- 
SA-PEG-20 kilodalton provided the product G~CSF-GalNAc-SA-PEG-20 kilodalton, which 
10 was verified by MALDI\ 5 with conversion yields of about 50% as determined by SDS-PAGE 
V The G-CSF-GalNAc could also be further elongated using core-l-Gal-T and UDP- 

i 

galactose to provide complete conversion to G-CSF-GalNAc-Gal\. Glyco-PEG-ylation of 
this intermediate with ST3Gal2 and CMP-SA-PEG-20 kilodalton then provided the product 
G-CSF-GalNAc-Gal~SA-PEG~20 kilodalton in overall yields of about 50% \. These 
1 5 reactions were performed either sequentially in one pot or simultaneously in one pot starting 
from G-CSF or its glycosylated intermediates. In these studies, little or no difference was 
observed in overall yield by using either approach. 

[0511] The products of the glycosylation or glyco-PEG-ylation reactions were purified 
using a combination of ion exchange and SEC. The ion exchange step removes the unreacted 

20 G-CSF or its glycosylated intermediates (GalNAc or GalNAc-Gal) as well as any unreacted 
CMP-SA-PEG-20 kilodaltonV The SEC step removed remaining unreacted G-CSF and other 
protein contaminants from the glycosyltransferases used in the processV The G-CSF's 
containing the GalNAc-SA-PEG-20 kilodalton or the GalNAc-Gal-SA-PEG-20 kilodalton 
had identical properties and retention times using these purification methods. The final 

25 products had typical profiles as shown in. 

[0512] Once purified, the PEG-ylated proteins were formulated in a PBS buffer containing 
2.5% mannitol and 0.005% Tween 80. Initially, pH 6.5 was used in the formulation but 
aggregation of the glyco-PEG-ylated protein was a concern (see below) so the formulation 
buffer pH was lowered to 5.0. Literature reports have indicated that G-CSF aggregation is 
30 prevented by maintaining a solution pH between 4-5. Endotoxin was removed using an 
endotoxin removal cartridge using sterile technique. Protein concentrations were typically 
adjusted to concentrations between 100 jig/mL to 1 mg/mL as required for biological studies. 
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Endotoxin calculations were typically below 3EU/ml by this process. The formulated 
products are stored at 4. 

[0513] The products were tested in an in vitro cell proliferation assay using NSF-60 cells 
sensitive to G-CSF. It was observed that both the GalNAc-SA-PEG-20 kilodalton and 
5 GalNAc-Gal-SA-20 kilodalton products were effective at initiating cell proliferation (FIG. 

1). 

[0514] An accelerated stability study was performed on a chemically PEG-ylate G-CSF and 
C (G-CSF-GalNAc-SA-PEG-20 kilodalton). The formulation buffer pH was adjusted to 8.0 
and the temperature was raised to 40 °C. Samples were taken of each protein at times 0, 72 
10 and 168 h (FIG. 6and FIG. 7). Chemically PEG-ylated G-CSF was observed to aggregate 
entirely under these conditions within 1 68 h. SEC using a Superdex 200 chromatography 
was used to separate the aggregates. Although the glycoconjugate G-CSF-GalNAc-SA-PEG- 
20 kilodalton also formed aggregates that were separable using SEC, the aggregation 
occurred at a much slower rate. 

15 [0515] The glyco-PEG-ylated G-CSF was radioiodinated using the Bolton Hunter reagent. 

A cold labeling study was also performed prior to the actual radiolabeling to determine the 

extent of aggregation and to establish a methodology for removing any aggregates formed. 

Use of the Bolton Hunter reagent (cold) did provide some aggregates as shown in FIG. 5. 

SEC using a Superdex 200 column removed the aggregates and provided the monomeric, 
20 labeled material. Similar results were obtained using I labeled reagent. The use of the 

formulation minimized aggregation on storage. Protein content was measured by measuring 

the absorbance at A280. 

[0516] The results of the rat pK study incorporating G-CSF, chemically PEG-ylated G-CSF 
and the PEG-G-CSF conjugate labeled with the Bolton Hunter reagent are shown in FIG. 3. 
25 In this study, blood and protein precipitated from plasma were counted for radioactivity after 
IV administration of 10 \xg of G-CSF conjugate per rat. The data from both blood and plasma 
protein clearly indicate that the PEG conjugate and Chemically PEG-ylated G-CSF have 
identical clearance rates (FIG. 3 and FIG. 8). 

[0517] The ability of the G-CSF derivatives to initiate WBC production was then examined 
30 in a mouse model. Each test compound was injected IV as a single bolus and the induction of 
WBC and neutrophils was monitored over time. Chemically PEG-ylated G-CSF was the 
most potent protein tested when administered at 250 |ag/kg. The PEG conjugate (G-CSF- 
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GalNAc-SA-PEG-20 kilodalton) induced WBC production to almost the same degree as 
Chemically PEG-ylated G-CSF at 250 |ig/kg, and far greater than G-CSF at a similar 
concentration. 

EXAMPLE 3 

5 [0518] This example discloses amino acid sequence mutations that introduce changes 
introduce O-linked glycosylation sites, I e. , serine or threonine residues, into a preferably 
proline-containing site in the 175 amino acid wild-type sequence of G-CSF or any modified 
version thereof. As a reference the 1 75 amino acid wild-type G-CSF sequence is shown 
below: 

1 0 MTPLGPASSLP QSFLLKCLEQ VRKIQGDGAA LQEKLCA 

TYKLCHPEEL VLLGHSLGIP WAPLSSCPSQ ALQLAGCLSQ 
LHSGLFLYQG LLQALEGISP ELGPTLDTLQ LDVADFATTI 
WQQMEELGMA PALQPTQGAM PAFASAFQRR AGGVLVASHL 
QSFLEVSYRV LRHLAQP (SEQ ID NO:2) 

15 3.1 N-terminal Mutations 

[0519] In the N-terminal mutants, the N-terminus of a wild-type G-CSF, M ! TPLGPA (SEQ 
ID NO:), is replaced with either M^TPLGPA or M^oPZ^XnTPLGPA. Wherein n, o and m 
are integers sleeted from 0 to 3, and at least one of X, B and O is Thr or Ser. When more 
than one of X, B and O is Thr or Ser, the identity of these moieties is independently selected. 

20 Where they appear, superscripts denote the position of the amino acid in the wild-type 
starting sequence. 

[0520] Preferred examples include: 

MVTPI/GPA (SEQ ID NO:) 

M 1 QTPL 4 GPA (SEQ ID NO:) 
25 M 1 ATPL 4 GP A (SEQ ID NO :) 

M 1 PTQGAMPL 4 GPA (SEQ ID NO:) 

M 1 VQTPL 4 GPA (SEQ ID NO:) 

M 1 QSTPL 4 GPA (SEQ ID NO:) 

M 1 GQTPL 4 GPA (SEQ ID NO:) 
30 M^APTSSSPL^PA (SEQ ID NO:) 
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M^PTPI/GPA (SEQ ID NO:) 

i 

3.2 Internal Mutation Site 1 

[0521] In these mutants, the N-terminus of a wild-type GCSF, M^PLGP (SEQ ID NO : 8), 
is replaced with M^TPXnBoOfP. Wherein n, o and r are integers sleeted from 0 to 3, and at 
least one of X, B and O is Thr or Ser. When more than one of X, B and O is Thr or Ser, the 
identity of these moieties is independently selected. Where they appear, superscripts denote 
the position of the amino acid in the wild-type starting sequence. 

[0522] Preferred mutations include: 

M 1 TPTLGP (SEQ ID NO:8) 
M 1 TPTQLGP (SEQ ID NO:8) 
M^TPTSLGP (SEQ ID NO:8) 
M 1 TPTQGP (SEQ ID NO:8) 
M^PTSSP (SEQ IDNO:8) 
M^PQTP (SEQ ID NO:8) 
M'TPTGP (SEQ ID NO: 8) 
M i TPLTP (SEQ ID NO: 8) 

M'TPNTGP (SEQ ED NO:8) 
M i TpvTp (SE q ID NO . 8) 

M'TPMVTP (SEQ ID NO: 8) 

MT lp2 TQGL 3 G 4p5 A 6 s 7 (ggQ j D NO;8) 

3.3 Internal Mutation Site 2 

[0523] This mutation is made for the purpose of maintaining G-CSF activity. In these 
mutants, the amino acid sequence containing H 53 , LGH 53 SLGI (SEQ ID NO: ) is mutated to 
LGH 53 B 0 LGI, where 0 is H, S, R,E or Y, and B is either Thr or Ser. 

[0524] Preferred examples include: 

LGHTLGI 

LGSSLGI 

LGYSLGI 

LGESLGI 

LGSTLGI 
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3. 4 Internal Mutation Site 3 

[0525] In this type of mutant, the amino acid sequence encompassing P 129 , P 129 ALQPT 
(SEQ ID NO: ), is mutated to P 129 Z m J q O r X„PT, wherein Z, J, O and X are independently 
selected from Thr or Ser, and m, q, r, and n are integers sleeted from 0 to 3. 

5 [0526] Preferred examples include: 

P 129 TLGPT 
P 129 TQGPT 
P 129 TSSPT 
P 129 TQGAPT 
10 P 129 NTGPT 

P 129 ALTPT 
P 129 MVTPT 
P 129 ASSTPT 

pl29 TT Qp 

15 P 129 NTLP 

P 129 TLQP 

MAP 129 ATQPTQGAM 
MP 129 ATTQPTQGAM 

3. 5 Internal Mutation Site 4 

20 [0527] In this type of mutant, the amino acid sequence surrounding P 61 , LGIPWAP 61 LSSC 
(SEQ ID NO:), is replaced with PZ m U s J q P 01 O r X„B 0 C, wherein m ? s 9 q, r, n, and o are integers 
sleeted from 0 to 3, and at least one of Z, J, O, X ? B and U is selected as either Thr or Ser. 
When more than one of Z 5 J 5 OX,B and U is Thr or Ser, each is independently selected 

[0528] Preferred examples include: 

25 P 61 TSSC . 

P 61 TSSAC 

LGIPTAP 61 LSSC 

LGIPTQ P 61 LSSC 

LGIPTQG P 61 LSSC 
30 LGIPQT P 61 LSSC 
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LGIPTS P 61 LSSC 
LGIPTS P 61 LSSC 
LGIPTQP 61 LSSC 
LGTPWAP 61 LSSC 
LGTPFA P 61 LSSC 

p61pyp 

SLGAP 58 TAP 61 LSS 

t 

3. 6 C -terminal Mutations 

[0529] In this type of mutant, the amino acid sequence at the C-terminus of a wild-type G- 
CSF, RHLAQP 175 (SEQ ID NO: ) is replaced with 0 a G p J q O r P 175 X„B o Z m U s Y t , wherein a, p, q : 
r, n, o, m, s, and t are integers sleeted from 0 to 3, and at least one of Z, U, O, J, G, 0, B and 
X is Thr or Ser and when more than one of Z, U, O, J, G, 0, B and X are Thr or Ser, they are 
independently selected. 0 is optionally R, and G is optionally H. The symbol T represents 
any uncharged amino acid residue or E (glutamate). 

[0530] Preferred examples include: 

RHLAQTP 175 

RHLAGQTP 175 

QP 175 TQGAMP 

RHLAQTP 1 75 AM 

QP 175 TSSAP 

QP 175 TSSAP 

QP 175 TQGAMP 

QP 175 TQGAM 

QP 175 TQGA 

QP 175 TVM 
Q P i75 NTGp 

QP 175 QTLP 

3. 7 Internal Mutations surrounding P 133 

[0531] Additional G-CSF mutants include those with internal mutations surrounding the 
amino acid P 133 . Examples include: 
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P^TQTAMP"* 

P 133 TQGTMP 

P 133 TQGTNP 

P 133 TQGTLP 

PALQP 133 TQTAMPA 



EXAMPLE 4 

[0532] Mutations in the amino acid sequence of granulocyte colony stimulating factor (G- 
CSF) can introduce additional sites for O-linked glycosylation, such that the protein may be 
modified at these sites using the method of the present invention. This example sets forth 
selected representative mutants of the invention. 

4. 1 G-CSF (wild type 1 78 aa variant) 

mtplgpasslp qsfllkcleq vrkiqgdgaa lqeklvseca tyklchpeel vllghslgip 
waplsscpsq alqlagclsq lhsglflyqg llqalegisp elgptldtlq ldvadfatti wqqmeelgma 
palqptqgam pafasafqrr aggvlvashl qsflevsyrv lrhlaqp (SEQ ID NO:l) 

4.2 G-CSF (wild type 175 aa variant) 

mtplgpasslp qsfllkcleq vrkiqgdgaa lqeklca tyklchpeel vllghslgip waplsscpsq 
alqlagclsq lhsglflyqg llqalegisp elgptldtlq ldvadfatti wqqmeelgma palqptqgam 
pafasafqrr aggvlvashl qsflevsyrv lrhlaqp (SEQ ID NO:3) 

4.9 G-CSF Mutant 1 (Amino Terminal mutation) 

miatplgpasslp qsfllkcleq vrkiqgdgaa lqeklcatyk lchpeelvll 
ghslgipwap lsscpsqalq lagclsqlhs glflyqgllq alegispelg ptldtlqldv 
adfattiwqq meelgmapal qptqgampaf asafqrragg vlvashlqsf 
levsyrvlrh laqp 

4J0G-CSF Mutant 2 (Amino Terminal mutation) 

mgvtetplgpasslp qsfllkcleq vrkiqgdgaa lqeklcatyk lchpeelvll 
ghslgipwap lsscpsqalq lagclsqlhs glflyqgllq alegispelg ptldtlqldv 
adfattiwqq meelgmapal qptqgampaf asafqrragg vlvashlqsf levsyrvlrh 
laqp 
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4.11 G-CSF Mutant 3 (Amino Terminal mutation) 

maptplgpasslp qsfllkcleq vrkiqgdgaa lqeklcatyk lchpeelvll ghslgipwap 
lsscpsqalq lagclsqlhs glflyqgllq alegispelg ptldtlqldv adfattiwqq 
meelgmapal qptqgampaf asafqrragg vlvashlqsf levsyrvlrh laqp 

i 

4.12 G-CSF Mutant 4 (Site 1) 

mtp 3 tqglgpasslp qsfllkcleq vrkiqgdgaa lqeklcatyk lchpeelvll 
ghslgipwap lsscpsqalq lagclsqlhs glflyqgllq alegispelg ptldtlqldv 
adfattiwqq meelgmapal qptqgampaf asafqrragg vlvashlqsf levsyrvlrh 
laqp 

4.13 G-CSF Mutant 5 (Site 3 ) 

Mtplgpasslp qsfllkcleq vrkiqgdgaa lqeklcatyk lchpeelvll ghslgipwap 
lsscpsqalq lagclsqlhs glflyqgllq alegispelg ptldtlqldv adfattiwqq 
meelgmap 129 at qptqgampaf asafqrragg vlvashlqsf levsyrvlrh laqp 

4.14 G-CSF Mutant 6 (Site 4) 

Mtplgpasslp qsfllkcleq vrkiqgdgaa lqeklcatyk lchpeelvll ghslgip 58 ftp 
lsscpsqalq lagclsqlhs glflyqgllq alegispelg ptldtlqldv adfattiwqq 
meelgmapaL qptqgampaf asafqrragg vlvashlqsf levsyrvlrh laqp 

EXAMPLE 5 

GlycoPEGylation of G-CSF produced in CHO cells 

5a. Preparation of Asialo-Granulocyte-Colony Stimulation Factor (G-CSF) 

[0533] G-CSF produced in CHO cells was dissolved at 2.5 mg/mL in 50 mM Tris 50 mM 
Tris-HCl pH 7.4, 0.15 M NaCl, 5 mM CaCl 2 and concentrated to 500 jj,L in a Centricon Plus 
20 centrifugal filter. The solution was incubated with 300 mU/mL Neuraminidase II (Vibrio 
cholerae) for 16 hours at 32 °C. To monitor the reaction a small aliquot of the reaction was 
diluted with the appropriate buffer and a IEF gel performed. The reaction mixture was then 
added to prewashed N-(p-aminophenyl)oxamic acid-agarose conjugate (800 jaL/mL reaction 
volume) and the washed beads gently rotated for 24 hours at 4 °C. The mixture was 
centrifuged at 10,000 rpm and the supernatant was collected. The beads were washed 3 times 
with Tris-EDTA buffer, once with 0.4 mL Tris-EDTA buffer and once with 0.2 mL of the 
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Tris-EDTA buffer and all supernatants were pooled. The supernatant was dialyzed at 4 °C 
against 50 mM Tris -HC1 pH 7.4, 1 M NaCl 5 0.05% NaN 3 and then twice more against 50 
mM Tris -HC1 pH 7.4, 1 M NaCl, 0.05% NaN 3 . The dialyzed solution was then concentrated 
using a Centricon Plus 20 centrifugal filter and stored at -20 °C. The conditions for the IEF 
gel were run according to the procedures and reagents provided by Invitrogen. Samples of 
native and desialylated G-CSF were dialyzed against water and analyzed by MALDI-TOF 
MS. 

5b. Preparation of G-CSF-(alpha2,3)-Sialyl~PEG 

[0534] Desialylated G-CSF was dissolved at 2.5 mg/mL in 50 mM Tris-HCl, 0. 1 5 M NaCl, 
0.05% NaN 3? pH 7.2. The solution was incubated with 1 mM CMP-sialic acid-PEG and 0.1 
U/mL of ST3Gall at 32°C for 2 days. To monitor the incorporation of sialic acid-PEG, a 
small aliquot of the reaction had CMP-SA-PEG-fluorescent ligand added; the label 
incorporated into the peptide was separated from the free label by gel filtration on a Toso 
Haas G3000SW analytical column using PBS buffer (pH 7.1). The fluorescent label 
incorporation into the peptide was quantitated using an in-line fluorescent detector. After 2 
days, the reaction mixture was purified using a Toso Haas G3000SW preparative column 
using PBS buffer (pH 7.1) and collecting fractions based on UV absorption. The product of 
the reaction was analyzed using SDS-PAGE and IEF analysis according to the procedures 
and reagents supplied by Invitrogen. Samples of native and PEGylated G-CSF were dialyzed 
against water and analyzed by MALDI-TOF MS. 

5c. Preparation of G-CSF-(alpha2,8)-Sialyl-PEG 

[0535] G-CSF produced in CHO cells, which contains an alpha 2,3-sialylated O-linked 
glycan, was dissolved at 2.5 mg/mL in 50 mM Tris-HCl, 0.15 M NaCl, 0.05% NaN 3 , pH 7.2. 
The solution was incubated with 1 mM CMP-sialic acid-PEG and 0.1 U/mL of CST-II at 
32°C for 2 days. To monitor the incorporation of sialic acid-PEG, a small aliquot of the 
reaction has CMP-SA-PEG-fluorescent ligand added; the label incorporated into the peptide 
was separated from the free label by gel filtration on a Toso Haas G3000SW analytical 
column using PBS buffer (pH 7.1). The fluorescent label incorporation into the peptide was 
quantitated using an in-line fluorescent detector. After 2 days, the reaction mixture was 
purified using a Toso Haas G3000SW preparative column using PBS buffer (pH 7.1) and 
collecting fractions based on UV absorption. The product of the reaction was analyzed using 

i 

SDS-PAGE and IEF analysis according to the procedures and reagents supplied by 
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Invitrogen. Samples of native and PEGylated G-CSF were dialyzed against water and 
analyzed by MALDI-TOF MS. 



5d. Preparation of G-CSF-(alpha 2,6)-Sialyl-PEG 

[0536] G-CSF ? containing only O-linked GalNAc, was dissolved at 2.5 mg/mL in 50 mM 
5 Tris-HCl, 0.15 M NaCl, 0.05% NaN 3 , pH 7.2. The solution was incubated with 1 mM CMP- 
sialic acid-PEG and 0.1 U/mL of ST6GalNAcI or II at 32°C for 2 days. To monitor the 
incorporation of sialic acid-PEG, a small aliquot of the reaction has CMP-S A-PEG- 
fluorescent ligand added; the label incorporated into the peptide was separated from the free 
label by gel filtration on a Toso Haas G3000SW analytical column using PBS buffer (pH 

10 7.1). The fluorescent label incorporation into the peptide was quantitated using an in-line 
fluorescent detector. After 2 days, the reaction mixture was purified using a Toso Haas 
G3000SW preparative column using PBS buffer (pH 7.1) and collecting fractions based on 
UV absorption. The product of the reaction was analyzed using SDS-PAGE and IEF analysis 
according to the procedures and reagents supplied by Invitrogen. Samples of native and 

1 5 PEGylated G-CSF were dialyzed against water and analyzed by MALDI-TOF MS. 

[0537] G-CSF produced in CHO cells was treated with Arthrobacter sialidase and was then 
purified by size exclusion on Superdex 75 and was treated with ST3Gall or ST3 Gal2 and 
then with CMP-SA-PEG 20Kda. The resulting molecule was purified by ion exchange and 
gel filtration and analysis by SDS PAGE demonstrated that the PEGylation was complete. 
20 This was the first demonstration of glycoPEGylation of an O-linked glycan. 



EXAMPLE 6 



Recombinant GCSF - Expression, refolding and purification 

• Harvest cells by centrifugation, discard supernatant. Results of growth on various 
media are shown in Figure 9. 

25 • Resuspend cell pellet in lOmM Tris pH7.4, 75mM NaCl, 5mM EDTA -use 

1 Oml/g (lysis buffer) 

• Microlluidize cells (French press works as well) 

• Centrifuge 3 Omin, 4°C at 5,000RPM-discard supernatant 

• Resuspend pellet in lysis buffer and centrifuge as above 
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• Wash IB ' s in 25mM Tris pH8, 1 OOmM NaCl, 1 %TX- 1 00, 1 % NaDOC, 5mM 
EDTA. Pellets are resuspended by pipetting and vortexing. Centrifuge 15min 4°C 
5,000RPM. Repeat this step once more (total of two washes) 

• Wash pellets two times in 25mM Tris pH8, 1 OOmM NaCl, 5mM EDTA to remove 
detergents, centrifuge as above 

• Resuspend pellets in dH20 to aliquot and centrifuge as above. Pellets are frozen at 
-20C 

• IB ' s are resuspended at 20mg/ml in 6M guanidineHCl, 5mM EDTA, 1 OOmM 
NaCl, lOOmM Tris pH8, lOmM DTT using a pipettor, followed by rotation for 2- 
4h at room temperature. 

• Centrifuge solubilized IB's for lmin at room temperature at 14,000RPM. Save 
supernatant. 

• Dilute supernatant 1 :20 with refold buffer 50mM MES pH6, 240mM NaCl, 
lOmM 

• KC1, 0.3mM lauryl maltoside, 0.055% PEG3350, ImM GSH, O.IM GSSG, 0.5M 
arginine and refold on rotator overnight at 4°C. 

• Transfer refold to Pierce snakeskin 7kDa MWCO for dialysis. Dialysis buffer 
20mM NaOAc pH4, 50mM NaCl, 0.005% Tween-80, O.lmM EDTA. Dialyze a 
total of 3 times versus at least a 200 fold excess at 4°C. 

• After dialysis pass material through a 0.45 uM filter. 

• Equlibrate SP-sepharose column with the dialysis buffer and apply sample. Wash 
column with dialysis buffer and elute with dialysis buffer containing a salt 
gradient up to 1M NaCl. Protein typically is eluted at 300-400mM NaCl. 

• Check material on SDS-PAGE (see e.g., Figure 10). 

EXAMPLE 7 

i 

The Two Enzvme Method in Two Pots 

[0538] The following example illustrates the preparation of G-CSF-GalNAc-S A-PEG in 
two sequential steps wherein each intermediate product is purified before it is used in the next 
step. 
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7a. Preparation of G-CSF-GalNAc (pH 6.2) from G-CSF and UDP-GalNAc 
using GalNAc-T2. 



[0539] G-CSF (960 ug) in 3.2 mL of packaged buffer was concentrated by utrafiltration 
using an UF filter (MWCO 5K) and then reconstituted with 1 mL of 25 mM MES buffer (pH 
6.2, 0.005% NaN 3 ). UDP-GalNAc (6 mg, 9.24 mM), GalNAc-T2 (40 uL, 0.04 U), and 100 

mM MnCl 2 (40 uL, 4 mM) were then added and the resulting solution was incubated at room 
temperature. 

[0540] After 24 hrs, MALDI indicated the reaction was complete. The reaction mixture 
was directly subjected to HPLC purification using SEC (Superdex 75 and Superdex 200) and 
an elution buffer comprising of PBS (phosphate buffered saline, pH 4.9 and 0.005% Tween 
80). The collected peak of G-CSF-GalNAc was concentrated using a Centricon 5 KDa 
MWCO filter to about 150 uL and the volume adjusted to 1ml using PBS (phosphate 
buffered saline, pH 4.9 and 0.005% Tween 80). Final protein concentration 1 mg/mL (A 280 ), 
yield 1 00%. The sample was stored at 4 °C. 

7b. Preparation of G-CSF-GalNAc-SA-PEG using purified G-CSF- 
GalNAc, CMP-SA-PEG (20KDa) and mouse ST6GalNAc-TI (pH 6.2). 

[0541] The G-CSF-GalNAc solution containing 1 mg of protein was buffer exchanged into 
25 mM MES buffer (pH 6.2, 0.005% NaN 3 ) and CMP-SA-PEG (20KDa) (5 mg, 0.25 umol) 
was added. After dissolving, MnCl 2 (100 mcL, 100 mM solution) and ST6GalNAc-I (100 
mcL, mouse enzyme) was added and the reaction mixture rocked slowly at 32 °C for three 
days. The reaction mixture was concentrated by ultrifiltration (MWCO 5K) and buffer 
exchanged with 25 mM NaOAc (pH 4.9) one time and then concentrated to 1 mL of total 
volume. The product was then purified using SP-sepharose (A: 25 mM NaOAc+0.005% 
tween-80 pH 4.5; B: 25 mM NaOAc+0.005% tween-80 pH 4.5+2M NaCl) at retention time 
13—18 mins and SEC (Superdex 75; PBS-pH 7.2, 0.005% Tween 80) at retention time 8.6 
mins (superdex 75, flow 1 ml/min) The desired fractions were collected, concentrated to 0.5 
mL and stored at 4 °C. 
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EXAMPLE 8 



One Pot Method to Make G-CSF-GalNAc -SA-PEG with Simultaneous 
Addition of Enzymes 

[0542] The following example illustrates the preparation of G-CSF-GalNAc -SA-PEG in 
5 one pot using simultaneous addition of enzymes 



8a. One Pot process using mouse ST6GalNAc-I (pH 6.0). 

[0543] G-CSF (960 jag of protein dissolved in 3.2 mL of the product formulation buffer) 
was concentrated by ultrafiltration (MWCO 5K) to 0.5 ml and reconstituted with 25 mM 
MES buffer (pH 6.0, 0.005% NaN 3 ) to a total volume of about 1 mL or a protein 

10 concentration of 1 mg/mL. UDP-GalNAc (6 mg, 9.21 pmol), GalNAc-T2 (80 i^L, 80 mil), 
CMP-SA-PEG (20KDa) (6 mg, 0,3 jamol ) and mouse enzyme ST6GalNAc-I (120 jliL) and 
100 mM MnCl 2 (50 jjL) were then added. The solution was rocked at 32°C for 48 hrs and 
purified using standard chromatography conditions on SP-sepharose. A total of 0.5 mg of 
protein (A 28 o) was obtained or about a 50% overall yield. The product structure was 

1 5 confirmed by analysis with both MALDI and SDS-PAGE. 



8b. One pot process using chicken ST6GalNAc-I (pH 6. 0). 

[0544] 14.4 mg of G-CSF; was concentrated to 3 mL final volume, buffer exchanged with 
25 mM MES buffer (pH 6.0, 0.05% NaN 3 , 0.004% Tween 80) and the volume was adjusted 
to 13 mL. The UDP-GalNAc (90 mg, 150 (amole), GalNAc-T2 (0.59 U), CMP-SA-PEG- 

20 20KDa (90 mg), chicken ST6GalNAc-I (0.44 U), and 100 mM MnCl 2 (600 mcL) were then 
added. The resulting mixture stood at room temperature for 60 hrs. The reaction mixture was 
then concentrated using a UF (MWCO 5K) and centrifugation. The residue (about 2 mL) 
was dissolved in 25 mM NaOAc buffer (pH 4.5) and concentrated again to 5 mL final 
volume. This sample was purified using SP-sepharose for about 10-23 min, SEC (Superdex 

25 75, 17 min, flow rate 0.5 ml/min) and an additional SEC (Superdex 200, 23 min, flow rate 0.5 
ml/min), to yield 3.6 mg (25% overall yield) of G-CSF-GalNAc-S A-PEG-20 KDa (A 280 and 
BCA method). 
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EXAMPLE 9 

One Pot Method to Make G-CSF-GalNAc-Gal-SA-PEG with Sequential Addition of 
Enzymes 

[0545] The following example illustrates a method for making G-CSF-GalNAc-Gal-SA- 
5 PEG in one pot with sequential addition of enzymes. 

9.1 Starting from GalNAc-G-CSF 

a. Preparation of G-CSF-GalNAc (pH 6.2) from G-CSF and UDP-GalNAc 
using GalNAc-T2. 

[0546] G-CSF (960 meg) in 3.2 mL of packaged buffer was concentrated by utrafiltration 
10 using an UF filter (MWCO 5K) and then reconstituted with 1 mL of 25 mM MES buffer (pH 
6.2, 0.005% NaN 3 ). UDP-GalNAc (6 mg, 9.24 mM), GalNAc-T2 (40 \xL, 0.04 U), and 100 

mM MnCl 2 (40 |uL, 4 mM) were then added and the resulting solution was incubated at room 
temperature. 

b. Preparation of G-CSF-GalNAc-Gal-SA-PEG from G-CSF-GalNAc ; UDP- 
1 5 Galactose, SA-PEG-20Kdalton, and the Appropriate Enzymes 

[0547] The UDP-Galactose (4 mg, 6.5 jumoles ), core-l-Gal-T (320 liL, 160 mU), CMP- 
SA-PEG-20KDa (8 mg, 0.4 jumole), ST3Gal2 (80 ^L, 0.07 mU) and 100 mM MnCl 2 ( 80 jaL) 
were directly added to the crude reaction mixture of the G-CSF-GalNAc (1 .5 mg) in 1 .5 ml 
25 mM MES buffer (pH 6.0) from step a, above. The resulting mixture was incubated at 

20 32°C for 60 hrs. The reaction mixture was centrifuged and the solution was concentrated 
using ultrafiltration (MWCO 5K) to 0.2 mL, and then redissolved with 25 mM NaOAc (pH 
4.5) to a final volume of 1 mL. The product was purified using SP-sepharose (retention time 
of between 10-15 min), the peak fraction were concentrated using a spin filter (MWCO 5K) 
and the residue purified further using SEC (Superdex 75, retention time of 10.2 min). After 

25 concentration using a spin filter (MWCO 5K), the protein was diluted to 1 mL using 

formulation buffer with PBS, 2.5% mannitol, 0.005% polysorbate, pH 6.5 and formulated at a 
protein concentration of 850 meg protein per mL (A 2 8o). The overall yield was 55%. 
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EXAMPLE 10 

One Pot Method to Make G-CSF-GalNAc-Gal-SA-PEG with 
Simultaneous Addition of Enzymes 

a. Starting from G-CSF. 

5 [0548] G-CSF (960 meg, 3 .2 ml) was concentrated by ultrafiltration (MWCO 5K) and 
reconstituted with 25 mM Mes buffer (pH 6.0, 0.005% NaN 3 ). The total volume of the G- 
CSF solution was about 1 mg/ml. UDP-GalNAc (6 mg), GalNAc-T2 ( 80 pL, -80 juU), UDP- 
Gal (6mg) 5 Corel GalT (160 |aL, 80 jaU), CMP-SA-PEG(20K) (6 mg) and a 2,3-(0)- 
sialyltransferase (160 |uL, 120 jaU) 3 100 mM MnCl2(40 j^L ) were added. The resulting 
10 mixture was incubated at 32°C for 48 h. Purification was performed as described below using 
IEX and SEC. The resulting fraction containing the product were concentrated using 
ultrafiltration (MWCO 5K) and the volume was adjusted to about 1 mL with buffer. The 
protein concentration was determined to be 0.392 mg/ml by A280, giving an overall yield of 
40% from G-CSF. 

15 EXAMPLE 11 

[0549] The following Example illustrates an alternative enzymatic method to obtain large 
quantities of GlycoPEGylated G-CSF. 

[0550] Granulocyte Colony Stimulating Factor (G-CSF) protein was expressed in E. coli 
and refolded from inclusion bodies as disclosed in Example X (above). 

20 11a. Priming the reaction by addition of GalNAc: 

[0551] GalNAc-ylation of G-CSF was carried out at 33 °C in 50 mM Bis-Tris pH 6.5 buffer 
containing 1 mM MnCl 2 using refolded GalNAcT2 in the presence UDP-GalNAc. This step 
primes the reaction enabling both GalNAc transferase and sialyltransferase to work together 
in subsequent steps to very efficiently produce maximum amount of GCSF-PEG in a short 
25 "period of time. 

lib. PEGylation process: 
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[0552] PEGylation was started 2 (+/- 1) hour after GalNAc-ylation by directly adding 
CMP-SA-PEG (20K) and ST6GalNAcI (chicken or human) to the priming reaction. This 
step produces substrate (GCSF-O-GalNAc) for the sialyltransferases to drive the reaction 
faster in a shorter period of time than can be achieved in a two step reaction wherein the 
5 GCSF-OGalNAc is first purified from the UDP-GalNAc and other reaction components (see 
e.g., Example X, above). Furthermore the primed one pot reaction produces a higher yield of 
product than does a one pot reaction in which all components are added simultaneously. 

[0553] Indeed, comparison of several types of one pot reactions shows that when all the 
components were added simultaneously and incubated for 23 hours, the GCSF-PEG produced 
10 was 77 %. In contrast, when addition of all the enzymes required for the PEGylation reaction 
was preceded by the 2 hr GalNAc-ylation step described above, product yield was 85 %. 
Therefore, the sequential addition of reaction components resulted in a 1 0 % higher yield 
than was obtained when all reaction components are added simultaneously. 

EXAMPLE 12 

1 5 [0554] This Example describes the results of Olinked GalNAc-ylation of six mutant G- 
CSF proteins. 

12.1. GalNAcsylation of mutant G-CSF protein: 

[0555] All the sequences of mutant G-CSF proteins are listed below. Having these 
proteins, 0-linked glycosylation was examined. Under the same condition for glycosylation 

20 of native G-CSF, GalNAc-T 2 (BV) was used in vitro with UDP-GalNAc in 25 mM MES 
buffer ( pH 6.0 ). MALDI was used to monitor the reaction. Measurement of increasing 
molecular weight of proteins provided GalNAc addition number. For one addition of 
GalNAc, increased molecular weight should be 203 Da. Based on MALDI results, we found . 
that mutant G-CSF-2, -3, -4, accepted one GalNAc; and mutant G-CSF-5 some addition was 

25 also observed, and mutant G-CSF- 1 accepted two GalNAcs, forming MAPT-G- 
CSF(GalNAc) 2 ( Molecular weight increasing from 18965 to 19369 Daltons). 



Table X. GalNAc addition of Mutant G-CSF ( MW measured by MALDI) 



Peptide 


MW(Intact material) 


MW (GalNAc- 


Number of GalNAc 
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adduct) 


addition 

i 


MutantG-CSF-l 
(MAPT-G-CSF) 


18965 


19369 


2 


MutantG-CSF-2 


18766 


19029 


1 


MutantG-CSF-3 


18822 


19026 


1 


MutantG-CSF-4 


19369 


19574 


1 


MutantG-CSF-5 


18957 


18853 


1 


MutantG-CSF-6 






NT 


Native G-CSF 


18800 


19023 


1 



[0556] Peptide mapping and N-terminal analysis were used for determination of 
glycosylation sites of MAPT-G-CSF~(GalNAc)?. In the Glu C-digested peptide mapping a G- 
1+GalNAc peak was found, indicating one GalNAc was added at G-l sequence. N-terminal 
5 Edman degdation analysis suggested the normal T was lost indicting that GalNAc was added 
onto T residue. 

12.2 GlycoPEGylation of mutant G-CSF sequences 

a. GlycoPEGylation of mutant G-CSF sequence and buffer impact on the 

glycoPEGylation of MAPT-G-CSF 

1 0 [0557] An examination of glycoPEGylation (20K) of 5 mutants was undertaken. 

GlycoPEGylation was performed using three enzyme/ three nucleotides system. (UDP- 
GalNAc/GalNAc-T 2 /UDP-Gal/Core GalT/CMP-SA-PEG/O-sialyltransferase) in 25 mM 
MES buffer (pH 6.0). All mutants can be monoglycoPEGylated. No appreciable 
diPEGYlation in this condition was detected by SDS-PAGE gel by Comassie Blue Stain. 

15 [0558] Since MAPT-G-CSF accept two GalNAcs, this mutant should receive two PEGs in 
theory. Accordingly, we examined the buffer impact on the PEGylation of MAPT-G-CSF as 
a starting material. Four different buffers ( 1. 1M MES buffer; 2. 25 mM MES buffer (pH 
6.0); 3. 50 mM Bis-tris buffer(pH 6.0); 4. 1M HEPS buffer (pH 7.4) were investigated for 
this reaction. It was found that MAPT-G-CSF can be PEGylated in all of the buffer system 

20 tested. However, monoPEGylation product was still a major one. In case 1M MES and 1 M 
HEPS buffer were used, some diPEGYylation product was formed, indicating that high 
concentration buffer improves the glycoPEGylation . 
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b. Comparison of GlycoPEGylation efficiency by forming MAPT-G-CSF(GaINAc- 
SA-PEG) 2 and MAPT-G-CSF(GaINAc-GaI-SA-PEG) 2 

[0559] In order to see glycoPEGylation efficiency of Muant G-CSF-1 catalyzed by different 
enzymes, two enzymes ( SteGalNAcI and O-siayltransferase) were examined for 
sialylPEGylation. Accordingly, MAPT-G-CSF was converted into MAPT-G-CSF(GalNAc) 2 
and MAPT-G-CSF(GalNAc-Gal) 2 for siaylPEGylation. The former was treated with CMP- 
SA-lys-PEG(20K)/ St6GalNAc I and the latter was treated with CMP-SA-PEG(20K)/O- 
sialyltransferase. Both reactions were performed in 25 mM MES buffer(pH 6.0) and lmg/ml 
protein concentration. The PEGylation efficiency can be seen in SDS-Page gel. It appeared 
that two enzymes were pretty similar in glycoPEGylation of this protein using CMP-SA-Lys- 
PEG (20KDa) under the condition tested. 

c. High protein concentration led to formation of MAPT-G-CSF((GalNAc-SA- 
PEG(20KDa)) 2 as a major product. 

[0560] After examining the impact of enzyme and buffer on glycoPEGylation, as described 
above, the influence of protein concentration on the PEGylation by combining with a factor 
of high buffer concentration using ST 6 GalNAcI as GlycoPEGylation enzyme. So we applied 
UDP-GalNAc/GalNAc-T 2 and CMP-SA-PEG(20KDa)/St6GalNAcI for glycoPEGylation of 
MAPT-G-CSF using 8-10 mg/ml protein concentration for reaction in 1 M MES buffer(pH 
6.0). The result suggested that under this condition, the desired diPEGylation product 
became the major. Over 90% conversion was also achieved by applying more CMP-SA-PEG 
(20K) and enzyme. PEGylated G-CSF product, MAPT-G-CSF((GalNAc-SA-PEG(20KDa)) 2 
was purified by combining SP-Sepharose and SEC purification on Supderdex 200. 

12.3. Cell proliferation activity ofMAPT-G-CSF-(GalNAc-SA-PEG) 2 

[0561] Cell proliferation assay of MAPT-G-CSF-(GalNAc-SA-PEG) 2 with NFS-60 cell line 
and Tf-1 cell line were performed. The assay was performed using protein concentration 

between 0 ng/ml to 1000 ng/ml. MAPT-G-CSF(GalNAc-SA-PEG(20K)) 2 was active in this 
assay. 

12.4 Experimental Details 

12.4a General procedure of GalNAcsylation of Mutant G-CSF 
[0562] Certain volume of mutant G-CSF solution (for 100 ug protein) was buffer exchanged 
with MES buffer ( 25 mM + 0.005% NaN 3 , pH 6.0 ). The final volume was adjusted to 100 
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ug/100 ul. To this solution was added 5 ul 100 niM MnCl 2 and GalNAc-T 2 ( 1 mU). The 
resulting mixture was rocked at rt for a period of time required for MALDI or QTOF 
analysis. 

12.4b Preparation ofMAPTP-G-CSF-fGalNAc)^ 

5 [0563] MAPTP-G-CSF 5.4 mg (KJ-675-1 59, 0. 1 8 mg/ml, 0.053 umol) was exchanged with 

MES buffer ( 25 mM + 0.005% NaN 3 , pH 6.0). The final volume was adjusted to 5.4 ml. To 

this solution, UDP-GalNAc (5 mg, 0.15 umol), 100 mM MnCl 2 0.25 ml and GalNAc-T 2 

(l.OU/ml, 50 ul ) were added. The resulting mixture was rocked at 32°C for 24h. M + 

(MALDI): 19364 (MAPT-G-CSF-(GalNAc) 2 verse 18951 (MVPTP-G-CSF). 

0 12.4c General procedure of glycoPEGylation of mutant G-CSF sequences by one-pot 

reaction) 

[0564] Mutant G-CSF 100 ug( Mutant G-CSF- 1,2,3,4,5) was mixted with UDP-GalNAc ( 0.6 
mg, 0.923 umol), GalNAc-T 2 ( 20 ul, 8 mU), UDP-Gal( 0.6 mg, 0.923 umol), Core 1 Gal T( 
20 ul, 10 mU), CMP-SA-PEG(20K) (1 mg, 0.05 umol), St3GalII( 20 ul, 28 mU), 100 mM 
> MnC12 3 ul in 100 ul 25 mM MES buffer( pH 6.0+ 0.005% NaN 3 ). The resulting mixture 
was rocked at rt for 24h. GlycoPEGylation was followed by SDS-PAGE. 

12. 4d Comparison of mutant G-CSF-1 glycoPEGyaltion(20KDa) in various buffer 
system 

[0565] GalNAc 2 -MATP-G-CSF ( 54 ug ) was buffer exchanged to the following four buffer 
system(l. 1 M MES buffer(pH 6.0); 2. 25 mM MES buffer(pH 6.0); 3. 50 mM Bis-Tris 
buffer (pH 6.5); 4. 1M HEPS buffer (pH 7.4). Then CMP-SA-PEG (20K) ( 216 ug ) 
ST6GalNAcI( BV, lU/mL, 2.5 ul), 100 mM MnCl 2 2.5 ul were added. The resulting mixture 
was rocked at rt for 24 h. SDS-PAGE gel was used to follow the reaction. 

12. 4e Comparision of GlycoPEGylation ofMAPT-G-CSF by using ST 6 GalNAc j and 
O-sialyltransferase (Wang787-29 and 787-40) 

12. 4el Using St6GalNAc I 
[0566] First step: 30 ml KJ-675-159 solution ( 0.18 mg/ml, 5.4 mg protein in total ) was 
concentrated by ultrifiltration (MWCO 5K) at 3500 g, and then buffer exchanged with 25 
mM MES buffer( pH 6.0). Final volume was adjusted to 5.4 ml in a plastic tube. GalNAc-T 2 
(LOU /ml, 50 ul) was added, followed by addition of 0.25 mL MnCl 2 . The resulting mixture 
was rocked at 32°C for 24 h. MALDI suggested that the reaction went to completion. The 
reaction mixture was concentrated by UF(MWCO 5K) and diluted with 25 mM MES buffer 
to 5 ml, then CMP-SA-PEG(20K) ( 2*25mg), ST 6 GalNA Cl ( BV, lU/ml) , 100 mM MnCl 2 
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0.25 ml were added. The resulting mixture was rocked at 32°C overnight. SDS-PAGE was 
used for the reaction. 

12. 4e2 Using Osilyltransferase(St3Galu) : 
[0567] 200 ug GalNAc 2 -MATP-G-CSF in 200 ul 25 mM MES buffer (pH 6.0) was mixed 
with UDP-Gal 0.6 mg and core GalT (0.2U/ml, 10 ul) and 10 ul 100 mM MnCl 2 . The 
resulting mixture was rocked at 32°C for 24h. The reaction mixture was concentrated by UF 
(MWCO 5K) and diluted with 25 mM MES buffer to 200 ul. CMP-SA-PEG (800 ug ), 
St 3 GalII (l.OU/ml, 10 ul ) , 10 ul 100 mM MnCl 2 were added. The resulting mixture was 
rocked at rt for 24 h. The resulting mixture was rocked at 32°C overnight. SDS-PAGE gel 
was used to follow the reaction. 

1 2. 4f MAPTP-G-CSF-(GalNAc-SA-PEG(20K) 2 Jrom glycoPEGylation of MAPT-G- 
CSF-(GalNAc) 2 (Wang 787-42) 

[0568] MAPTP-G-CSF solution (540 ug) was concentrated and exchanged with 1M MES 
buffer (pH 6.0) and adjusted to 50 ul. Then UDP-GalNAc (100 ug, 0.15 umol, 5 eq), 
GalNAcT 2 (5.0 U/ml, 5 ul) and 100 mM MnCl 2 (5 ul ) was added. The resulting mixture was 
rocked at RT overnight. Then CMP-SA-PEG (20K) (2.16 mg, 0.108 umol) and St 6 GalNAcI 
(1.0 U/ml, 50 ul) were added. The solution was rocked at rt for 60h.. Additional CMP-SA- 
PEG(20K) (2.16 mg , 0.108 umol) and St6GalNAcI (l.OU/ml, 50 ul) were added, followed by 
slow rotation at rt for 24 h. Reaction mixture was exchanged with buffer A (25 mM NaOAc, 
0.005% polysorbate 80, pH 4.5), then purified on an Amersham SP-FF (5 mL) column with 
an isocratic elution of 100% A for 10 minutes followed by a linear gradient of 100% A to 20 
% B over 20 minutes at a flow rate of 3 mL min 1 , where B = 25 mM NaOAc, 2 M NaCl 
0.005% polysorbate 80, pH 4.5. The peak at retention time 17 mins was pooled and 
concentrated to 0.5 ml, which was further purified on an Amersham HiLoad Superdex 200 
(16 x 600 mm, 34 urn) with phosphate buffered saline, pH 5.0, 0.005% Tween80, at a flow 
rate of 0.4 mL min 1 . Product fractions at retention time 160 mins was pooled, concentrated to 
provide 30 ug of MAPT-G-CSF(GalNAc-SA-PEG(20K)) 2 ( BCA). The yield was not 
optimized. 

12.4 g Sequences of G-CSF mutants 

Mutant G-CSF-1: 

MAPTPLGPASSLPQSFLLKCLEQVRKIQGDGAALQEKLCATYKLCHPEELVLLGHSL 
GIPWAPLSSCPSQALQLAGCLSQLHSGLFLYQGLLQALEGISPELGPTLDTLQLDVAD 
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FATTIWQQMEELGMAPALQPTQGAMPAFASAFQRRAGGVLVASHLQSFLEVSYRVL 
RHLAQP (SEQ ID NO: 9) 

Mutant G-CSF-2: 

MTPLGPASSLPQSFLLKCLEQVPvKIQGDGAALQEKLCATYKLCHPEELVLLGHSLGIP 

WAPLSSCPSQALQLAGCLSQLHSGLFLYQGLLQALEGISPELGPTLDTLQLDVADFAT 

TIWQQMEELGMAPATQPTQGAMPAFASAFQRRAGGVLVASHLQSFLEVSYRVLRHL 
AQP (SEQ ID NO: ) 

Mutant G-CSF-3: 

MTPLGPASSLPQSFLLKCLEQVRKIQGDGAALQEKLCATYKLCHPEELVLLGHSLGIP 

WAPLSSCPSQALQLAGCLSQLHSGLFLYQGLLQALEGISPELGPTLDTLQLDVADFAT 

TIWQQMEELGMAPALQPTQTAMPAFASAFQRRAGGVLVASHLQSFLEVSYRVLRHL 
AQP (SEQ ID NO: ) 

Mutant G-CSF-4 rC-terminal tag): 

MTPLGPASSLPQSFLLKCLEQVRKIQGDGAALQEKLCATYKLCHPEELVLLGHSLGIP 

WAPLSSCPSQALQLAGCLSQLHSGLFLYQGLLQALEGISPELGPTLDTLQLDVADFAT 

TIWQQMEELGMAPALQPTQGAMPAFASAFQRRAGGVLVASHLQSFLEVSYRVLRHL 
AQPTQGAMP (SEQ ID NO:8 ) 

Mutant G-CSF-5 ( N-terminal MTATP); 

MIATPLGPASSLPQSFLLKCLEQVRKIQGDGAALQEKLGATYKLCHPEELVLLGHSLG 

IPWAPLSSCPSQALQLAGCLSQLHSGLFLYQGLLQALEGISPELGPTLDTLQLDVADF 

ATTIWQQMEELGMAPALQPTQGAMPAFASAFQRRAGGVLVASHLQSFLEVSYRVLR 
HLAQP (SEQ ID NO:10 ) 

Mutant G-CSF-6 Y 177 Mer); 

MTPLGPASSLPQSFLLKCLEQVRKIQGDGAALQEKLVSECATYKLCHPEELVLLGHS 

LGIPWAPLSSCPSQALQLAGCLSQLHSGLFLYQGLLQALEGISPELGPTLDTLQLDVA 

DFATTIWQQMEELGMAPALQPTQGAMPAFASAFQRRAGGVLVASHLQSFLEVSYRV 
LRHLAQP (SEQ ID NO:l) 



154 



WO 2005/070138 PCT/US2005/000799 

Human recombinant G-CSF expressed in E coli: 



MTPLGPASSLPQSFLLKCLEQVRKIQGDGAALQEKLCATYKLCHPEELVLLGHSLGIP 

WAPLSSCPSQALQLAGCLSQLHSGLFLYQGLLQALEGISPELGPTLDTLQLDYADFAT 

TIWQQMEELGMAPALQPT 134 QGAMPAFASAFQRRAGGVLVASHLQSFLEVSYRVLR 
HLAQP (SEQ ID NO:2) 

EXAMPLE 13 

[0569] The following Example illustrates preparation of a GlycoPEGylated hGH protein The 
wild-type hGH has no natural glycosylation site, therefore a de novo O-glycosylation site was 
engineered into a mutant hGH protein which was then be glycosylated with a GalNAc 
transferase and sialylPEGylated at the mutant site. Five mutant hGH proteins were designed 
to incorporate an O-glycosylation site at either the amino terminus or in the loop region of the 
protein molecule. The five mutant proteins were produced and each was tested for hGH 
activity in a Nb2-1 1 cell proliferation assay. 

13.1 Mutant h GH Amino Acid Sequences: 

192 amino acid Wild-type pituitary derived hGH comprising an N- 
Terminal methionine 

MFPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEAYIPKEQKYSFLQNP 
QTSLCFSESIPTPSNREETQQKSNLELLPJSLLLIQSWLEPVQFLRSVFANSLVYGASDS 

NVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSPCFDTNSHNDDALLKNYGLLYC 
FRKDMDKVETFLRIVQCRSVEGSCGF (SEQ ID NO:) 

191 amino acid Wil d-type pituitary derived hGH lacking an N-Terminal 

methionine 

i 

FPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEAYIPKEQKYSFLQNPQ 

TSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLRSVFANSLVYGASDSN 

VYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDDALLKNYGLLYCF 
RKDMDKVETFLRIVQCRSVEGSCGF (SEQ ID NO:) 
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MVTP mutant: 

(M)VTPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEAYIPKEQKYSFL 
QNPQTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLRSVFANSLVYGA 
SDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDDALLKNYGL 
5 LYCFRKDMDKVETFLRIVQCRSVEGSCGF (SEQ ID NO: ) 

PTOGAMP mutant: 

MFPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEAYIPKEQKYSFLQNP 
QTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLRSVFANSLVYGASDS 
NVYDLLKDLEEGIOTLMGR LEDGSPTOGAMPK OTYSKFDTNSHNDDALLKNYGLLY 
10 CFRKDMDKVETFLRTVQCRSVEGSCGF (SEQ ID NO: ) 

TTT mutant: 

MFPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEAYIPKEQKYSFLQNP 
QTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLRSVFANSLVYGASDS 
NVYDLLKDLEEGIOTLMG RLEDGSPTTTOIFK OTYSKFDTNSHNDDALLKNYGLLYC 
15 FRKDMDKVETFLRIVQCRSVEGSCGF (SEQ ID NO: ) 

MAPT mutant: 

MAPTSSPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEAYIPKEQKYSF 
LQNPQTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLRSVFANSLVYG 
ASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDDALLKNYG 
20 LLYCFRKDMDKVETFLRIVQCRSVEGSCGF (SEQ ID NO: ) 

NTG mutant: 

MFPTIPLSRLFDNAMLRAHRLHQLAFDTYQEFEEAYIPKEQKYSFLQNP 
QTSLCFSESIPTPSNREETQQKSNLELLRISLLLIQSWLEPVQFLRSVFANSLVYGASDS 
25 NVYDLLKDLEEGIQTLMGRLEDGSPNTGQIFKQTYSKFDTNSHNDDALLKNYGLLY 
CFRKDMDKVETFLRIVQCRSVEGSCGF 

[0570] The four hGH mutants were tested for the ability to act as substrates for 
glycosyltransferase GalNAcT2. Of the four hGH mutants, two were found to be glycosylated 
by GalNAcT2 by MALDI-MS analysis. 
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13.2 Preparation ofhGH-(TTT)-GalNAc-SA-PEG-30KDa. 

[0571] For the TTT mutant, GalNAc addition gave rise to a complex mixture of 
unglycosylated, and 1 -GalNAc and 2-GalNAc species. Peptide mapping experiments 
(trypsin digest) showed that the two GalNAc's were added to the T12 peptide (L129-K141) 
5 containing the TTT mutation. The (M)VTP mutant showed only a trace of GalNAc added by 
MALDI-MS. 

[0572] The hGH-TTT-mutant (4.0 mL, 6.0 mg, 0.27 micromoles) was buffer exchanged 
twice with 15 mL of Washing Buffer (20 mM HEPES, 150 mM NaCl, 0.02% NaN 3 , pH 7.4) 
and once with Reaction Buffer (20 mM HEPES, 150 mM NaCl, 5 mM MnCl 2 , 5 mM MgCl 2 , 
10 0.02% NaN 3 , pH 7.4) then concentrated to 2.0 mL using a Centricon centrifugal filter, 5 KDa 
MWCO. 

[0573] The hGH-TTT mutant was combined with UDP-GalNAc (1.38 micromoles, 0.90 
mg) and GalNAc-T2 (0.12 mL, 120 mU). The reaction was incubated at 32°C with gentle 
shaking for 19 hours. The reaction was analyzed by MALDI-MS and partial addition of 

1 5 GalNAc to the hGH-TTT mutant was observed (approximately 40%). CMP-SA-PEG-30K 
(16 mg, 0.533 micromoles) and ST6GalNAcl (0.375 mL, 375 mU) were added to the 
reaction mixture to bring the total volume to 2.85 mL. The reaction was incubated at 32°C 
with gentle shaking for 22 h. The reaction was monitored by SDS PAGE at 0 h and 22 h. The 
extent of reaction was determined by SDS-PAGE gel. The product, hGH-(TTT)-GalNAc-SA- 

20 PEG-30 KDa, was purified using SP Sepharose and analyzed by SDS-PAGE. Very low yield 
of the desired hGH-(TTT)-GalNAc-SA-PEG-30 KDa was observed. 

13.3 Preparation of h GH-(PTQGAMP)-GalNAc-SA-PEG-30KDa. 

[0574] The PTQGAMP mutant was was readily glycosylated with UDP-GalNAc and 
GalNAc T2, then GlycoPEGylated using CMP-SA-PEG-30KDa and ST6GalNAcl on 10 mg 
25 scale to yield 1 .45 mg of purified hGH-(PTQGAMP)-GalNAc-SA-PEG-30KDa. Peptide 

mapping experiments (trypsin digest) located the GalNAc on the trypsin T12 peptide (L129- 
K141) containing the PTQGAMP mutation. 

[0575] The hGH-PTQGAMP-mutant (4.55 mL, 10.0 mg, 0.45 micromoles) was buffer 
exchanged twice with 15 mL of Washing Buffer (20 mM HEPES, 150 mM NaCl, 0.02% 
30 NaN 3 , pH 7.4) and once with Reaction Buffer (20 mM HEPES, 1 50 mM NaCl, 5 mM MnCl 2 , 
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5 mM MgCl 2 , 0.02% NaN 3 , pH 7.4) then concentrated to 3 mL using a Centricon centrifugal 
filter, 5 KDa MWCO. 

[0576] The hGH-PTQGAMP mutant was combined with UDP-GalNAc (2.26 micromoles, 
1.47 mg) and GalNAc-T2 (0.1 mL, 100 mU). The reaction was incubated at 32°C with gentle 
5 shaking for 22 hours. The reaction was analyzed by MALDI-MS and complete addition of 
GalNAc to the hGH-PTQGAMP mutant was observed. CMP-S A-PEG-3 OK (27 mg, 0.9 
micromoles) and ST6GalNAcl (0.350 mL, 350 mU) were added to the reaction mixture to 
bring the total volume to 3.4 mL. The reaction was incubated at 32°C with gentle shaking for 
24 h. The reaction was monitored by SDS PAGE at 0 hours and 16.5 hours. The extent of 
1 0 reaction was determined by SDS-PAGE gel. The product, hGH-(PTQGAMP)-GalNAc-SA- 
PEG-30 KDa, was purified using SP Sepharoseand SEC (Superdex 200) chromato gr aphy and 
then formulated. The final product was analyzed by MALDI, peptide map and SDS-PAGE 
(silver stain). Protein was determined by BCA vs. BSA standard. The overall isolated yield 
(1.45 mg) was 12.5 % based on protein. 

15 EXAMPLE 14 

[0577] This example sets forth the preparation of a GM-CSF PEG glycoconjugate of the 
invention. 

14 J Preparation of(PEG(20K)-SA-Gal-GalNAc)2-GM-CSF and PEG(20k)-SA-Gal- 
GaWAc-GM-CSF 

20 [0578] GM-CSF (1 mg) was dissolved in 25 mM MES buffer (1 mL) (pH 6.0, 0.005% 
NaN 3 ), then UDP-GalNAc (1 mg), GalNAc-T 2 (200 |iL, 0.38 U/mL, 0.076 U), 100 mM 
MnCl 2 (80 jllL) were added. The resulting mixture was incubated at room temperature for 72 
h. MALDI indicated GalNAc2-GM-CSF was formed. 

[0579] UDP-Gal (6 mg, 9.8 mmol ), core-l-Gal-Ti (0.5 U/mL, 80 pL), CMP-SA-PEG (20 
25 kilodalton) (6 mg, 0.3 pmol), a-(0)-sialyltransferase (1 U/mL, 120 pL), 100 mM MnCl 2 (50 
jaL) were added. The resulting mixture was slowly rotated at 32° C for 48 h. The reaction 
mixture was centrifuged at 2 rpm for 5 min. The protein solution was taken. The remain 
resin was mixed with 1 mL 25 mM MES buffer (pH 6.0) and vibrated for 30 sec. The 
suspension was concentrated in again; the protein solutions were combined and concentrated 
30 to 200 mcL. HPLC Purification provided glyco-PEG-ylated GM-CSF. 
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EXAMPLE 15 

[0580] An O-linked glycosylation site similar to that of interferon alpha-2 can be 
incorporated into any interferon alpha protein at the same relative position. This can be 
performed by aligning the amino acid sequence of interest with the IFN-alpha-2b sequence 
(10-20 amino acids long) and modifying the amino acid sequence to incorporate the 
glycosylation site. Mutation with any amino acid, deletion or insertion can be used to create 
the site. Exemplary mutants maintain as high an homology as possible with the IFN-alpha-2 
sequence in this region with an emphasis on the T at position 106 (shown below in bold). An 
example of how this is performed is shown below. 

Alignments of Interferon alpha's in the NCBI Protein Database 
GI# AA# AA Sequence Name 

IFN-a-2p 1 CVIQGVGVTETPLMKEDSIL 20 (SEQ ID NO:X) 



(a,b, c) 



124449 


98 
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IFN-alpha 
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1/13 

[0581] Glycosylation/Glyco-PEG-ylation occurs at T 106 (IFN-alpha-2). Protein numbering 
begins with the first amino acid after removal of the protein leader sequence of the naturally 
expressed pre-pro form. 

[0582] Interferon alpha mutations to introduce O-Linked Glycosylation Sites in IFN- 
alpha' s that lack this site. 
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GI# 



AA# AA Sequence 



Name 



IFN-a-2P 1 CVIQGVGVTETPLMKEDSIL 20 (SEQ ID NO:X) 

124449 98 117 IFN-alpha 2 



a,b, c) 



20178265 99 E...T N 118 IFN-alpha 14 



E 107 T) 



20178265 99 . ...G...T N 118 IFN-alpha 14 



E 103 G; E 107 T) 



124453 99 ....E...T N 118 IFN-alpha 10 



E 107 T) 



124453 99 ....G...T N 118 IFN-alpha 10 



E 103 G; E 107 T) 



585316 99 



E 107 T) 



585316 99 



ME 107 VT ) 



585316 99 



E i03 G; E i07 T) 



E 107 T) 



E 103 G; E 107 T) 



E 107 T) 



E 103 G; E 107 T) 



I 107 T) 



E i03 G; x xoi T) 



E..MT N 118 IFN-alpha 17 



E..VT N 118 IFN-alpha 17 



G..MT N 118 IFN-alpha 17 



124442 99 ....E...T N..F.. 118 IFN-alpha 7 



124442 99 ....G...T N..F.. 118 IFN-alpha 7 



124438 99 ....E...T NV. . . . 118 IFN-alpha 4 



124438 99 . ...G. ..T NV. . . . 118 IFN-alpha 4 



417188 99 . .M.E. . .T.S. . . Y 118 IFN-alpha 8 



417188 99 . .M.G. . .T.S. . .Y 118 IFN-alpha 8 



20178289 99 ....E...T NV. . . . 118 IFN-alpha 21 



E 107 T) 
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20178289 99 G...T NV 118 IFN-alpha 21 

(E 103 G; E 107 T) 

124457 99 .MM.E...TD NV 118 IFN-alpha 5 

(E 107 T) 

5 124457 99 .MM.E...TB NV 118 IFN-alpha 5 

(ED 108 TE) 

124457 99 .MM. 6. . . TD NV 118 IFN-alpha 5 

(E 103 G; E 107 T) 

124463 99 . . T . E . . . T . IP. . N 118 IFN-alpha 16 

10 (E 107 T;A 110 P) 

124463 99 . . T . E . . . T . TP. . N 118 IFN-alpha 16 

(E 107 T; IA n° TP ) 

124463 99 . . T . G. . . T . TP. . N 118 IFN-alpha 16 

(E 103 G;E 107 T; IA 110 TP) 

15 124460 99 ..M.E.W.TG N 118 IFN-alpha 6 

(G 107 T) 

124460 99 . .M.E.G.TG. . . .N 118 IFN-alpha 6 

(W 105 G;G 107 T) 

124460 99 . .M.G.G.TE N 118 IFN-alpha 6 

20 (E103G;W 105 G;GG 108 TE) 

124455 99 ..M.EER.T NA. . . . 118 IFN-alpha 

1/13 (G 107 T) 

124455 99 ..M.EEG.T NA. . . . 118 IFN-alpha 

1/13 (R 105 G;G 107 T) 

25 124455 99 ..M.GVG.T NA. . . . 118 IFN-alpha 

1/13 (EER 105 GVG;G 107 T) 

The GI numbers in the above table, except the first number 124449, refer to those of the 
unmodified wild-type proteins. 

[0583] The O-linked glycosylation site can be created in any interferon alpha isoform by 
30 placing a T or S at the appropriate amino acid site as shown above. The substitution is T as 
shown in the above table. The amino acid sequences between the various interferon alpha 
forms are similar. Any amino acid mutation, insertion, deletion can be made in this region as 
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long as the T or S is at the appropriate position for glycosylation/glyco-PEG-ylation relative 
to P 109 (IFN-alpha-2) in the alignment sequence shown above. 

[0584] While this invention has been disclosed with reference to specific embodiments, it is 
apparent that other embodiments and variations of this invention may be devised by others 
5 skilled in the art without departing from the true spirit and scope of the invention. 

[0585] All patents, patent applications, and other publications cited in this application are 
incorporated by reference in the entirety. 
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1 

2 WHAT IS CLAIMED IS: 

1 1 . An isolated polypeptide comprising a mutant peptide sequence, 

2 wherein the mutant peptide sequence encodes an O-linked glycosylation site that does not 

3 exist in a wild-type polypeptide corresponding to the isolated polypeptide. 

1 2. The polypeptide of claim 1, wherein the polypeptide is a G-CSF 

2 polypeptide. 

1 3. The polypeptide of claim 2 5 wherein the G-CSF polypeptide comprises 

2 a mutant peptide sequence with the formula of M^XnTPLGP or M^oPZmXnTPLGP, and 

3 wherein 

4 the superscript denotes the position of the amino acid in the wild-type G-CSF 

5 amino acid sequence (SEQ ID NO: 3), the subscripts n and m are integers selected from 0 to 

6 3, and 

7 at least one of X and B is Thr or Ser, and 

8 when more than one of X and B is Thr or Ser, the identity of these moieties is 

9 independently selected, and 

10 Z is selected from glutamate, or any uncharged amino acid. 

1 4. The mutant G-CSF polypeptide of claim 3, wherein the mutant peptide 

2 sequence is selected from the sequences consisting of MVTPLGP, MQTPLGP, 

3 MIATPLGP), MATPLGP, MPTQGAMPLGP , MVQTPLGP, MQSTPLGP, 

4 MGQTPLGP, MAPTSSSPLGP, and MAPTPLGPA. 

1 5. The polypeptide of claim 2, wherein the G-CSF polypeptide comprises 

2 a mutant peptide sequence with the formula of M^TPXnBoOrP 

3 wherein 

4 the superscript denotes the position of the amino acid in SEQ ID NO: 3, and 

5 the subscripts n, o ? and r are integers selected from 0 to 3, and 

6 at least one of X, B and O is Thr or Ser, and 

7 when more than one of X, B and O is Thr or Ser, the identity of these moieties 

8 is independently selected. 
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1 6. The polypeptide of claim 5, wherein the mutant peptide sequence is 

2 selected from the sequences consisting of: MTPTLGP, MTPTQLGP, MTPTSLGP, 

3 MTPTQGP, MTPTSSP, M^PQTP, M^PTGP, M^PLTP, M^PNTGP, MTPLGP (G- 

4 CSF mut #4), M'TPVTP, M^PMVTP, and MT^TQGI^gVa^ 7 . 

1 7. The polypeptide of claim 2, wherein the G-CSF polypeptide comprises 

2 a mutant peptide sequence with the formula of LGX 53 B 0 LGI 

3 wherein 

4 the superscript denotes the position of the amino acid in the wild type G-CSF 

5 amino acid sequence (SEQ ID NO: 3), and 

6 X is histidine, serine, arginine, glutamic acid or tyrosine, and 

7 B is either threonine or serine, and 

8 o is an integer from 0 to 3 . 

1 8. The polypeptide of claim 7 5 wherein the mutant peptide sequence is 

2 selected from the sequences consisting of: LGHTLGI, LGSSLGI, LGYSLGI, LGESLGI, 

3 and LGSTLGI. 

1 9. The polypeptide of claim 2, wherein the G-CSF polypeptide comprises 

2 a mutant peptide sequence with the formula of P 129 Z m J q O r X n PT 

3 wherein 

4 the superscript denotes the position of the amino acid in the wild type G-CSF 

5 amino acid sequence (SEQ ID NO. 3), 

6 Z, J, O and X are independently selected from Thr or Ser, and 

7 m, q, r, and n are integers independently selected from 0 to 3.. 

1 10. The polypeptide of claim 9, wherein the mutant peptide sequence is 

2 selected from the sequences consisting of: P 129 ATQPT, P 129 TLGPT, P 129 TQGPT, 

3 P 129 TSSPT 5 P 129 TQGAPT ? P 129 NTGPT, PALQPTQT, P 129 ALTPT, P 129 MVTPT ? 

4 P 129 ASSTPT 5 P 129 TTQP 5 P 129 NTLP, P 129 TLQP, MAP 129 ATQPTQGAM, and 

5 MP 129 ATTQPTQGAM. 

1 11. The polypeptide of claim 2 ? wherein the G-CSF polypeptide comprises 

2 a mutant peptide sequence with the formula of PZ m U s JqP 61 O r X n B 0 C 

3 wherein 
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4 the superscript denotes the position of the amino acid in the wild type G-CSF 

5 amino acid sequence (SEQ ID NO. 3), 

6 at least one of Z, J, O, and U is selected from threonine or serine, and 

7 when more than one of Z, J, O and U is threonine or serine, each is 

8 independently selected, and 

9 m, s, q, r, n, and o are integers independently selected from 0 to 3. 

1 12. The polypeptide of claim 1 1, wherein the mutant peptide sequence is 

2 selected from the sequences consisting of: P 61 TSSC, P 61 TSSAC, LGIPTA P 61 LSSC, 

3 LGIPTQ P 61 LSSC, LGIPTQG P 61 LSSC, LGIPQT P 61 LSSC, LGIPTS P 61 LSSC, LGIPTS 

4 P 61 LSSC, LGIPTQP 61 LSSC, LGTPWAP 61 LSSC, LGTPFA P 61 LSSC, P 61 FTP, and 

5 SLGAP 58 TAP 61 LSS. 

1 13. The polypeptide of claim 2, wherein the G-CSF polypeptide comprises 

2 a mutant peptide sequence with the formula of 0aGpJqO r P 175 XnB o Z m U s v Ft 

3 wherein 

4 the superscript denotes the position of the amino acid in the wild type G-CSF 

5 amino acid sequence (SEQ ID NO. 3), 

6 at least one of Z, U, O, J ? G, 0, B and X is threonine or serine, and when more 

7 than one of Z, U, O, J, G, 0, B and X are threonine or serine, they are 

8 independently selected; 0 is optionally R, and G is optionally H; the symbol *P 

9 represents any uncharged amino acid residue or glutamate and 

10 a, p, q, r, n, o, m, s, and t are integers independently selected from 0 to 3.. 

1 14. The polypeptide of claim 13, wherein the mutant peptide sequence is 

2 selected from the sequences consisting of: RHLAQTP 175 , RHLAGQTP 175 , 

3 QP 175 TQGAMP, RHLAQTP 175 AM, QP 175 TSSAP, QP 175 TSSAP, QP 175 TQGAMP, 

4 QP 175 TQGAM, QP 175 TQGA, QP 175 TVM, QP 175 NTGP, and QP 175 QTLP. 

1 15. The polypeptide of claim 2, comprises a mutant peptide sequence 

2 selected from the sequences P 133 TQTAMP 139 , P 133 TQGTMP, P 133 TQGTNP, 

3 P 133 TQGTLP, and P ALQP 133 TQTAMP A. 

1 16. The polypeptide of claim 1, wherein the polypeptide is an hGH 

2 polypeptide. 
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1 17. The polypeptide of claim 1 6, wherein the mutant peptide sequence 

2 comprises a sequence selected from: IV^APTSSPTIPI^SR 9 and D GSP 1 33 NTGQIFK 1 40 

1 18. The polypeptide of claim 1 5, wherein the hGH polypeptide comprises 

2 a mutant peptide sequence with a formula of P 133 JXBOZUK 140 QTYS, and 

3 wherein 

4 the superscript denotes the position of the amino acid in the wild type hGH 

5 amino acid sequence (SEQ ID NO: 20), and 

6 J is selected from threonine and arginine; 

7 X is selected from alanine, glutamine, isoleucine, and threonine; 

8 B is selected from glycine, alanine, leucine, valine, asparagine, glutamine, and 

9 threonine; 

10 O is selected from tyrosine, serine, alanine, and threonine; 

11 Z is selected from isoleucine and methionine; and 

12 U is selcted from phenylalanine and proline. 

1 19. The polypeptide of claim 18, wherein the mutant peptide sequence is 

2 selected from the group consisting of PTTGQIFK, PTTAQIFK, PTTLQIFK, 

3 PTTLYVFK, PTTVQIFK, PTTVSIFK, PTTNQIFK, PTTQQIFK, PTATQIFK, 

4 PTQGQIFK, PTQGAIFK, PTQGAMFK, PTIGQIFK, PTINQIFK, PTINTIFK, 

5 PTILQIFK, PTIVQIFK, PTIQQIFK, PTIAQIFK, P 133 TTTQIFK 140 QTYS, and 

6 P 133 TQGAMPK 140 QTYS. 

1 20. The polypeptide of claim 1 5, wherein the hGH polypeptide comprises 

2 a mutant peptide sequence with a formula of P 133 RTGQIPTQBYS 

3 wherein 

4 the superscript denotes the position of the amino acid in the wild type hGH 

5 amino acid sequence (SEQ ID NO:20), and 

6 B is selected from alanine and threonine. 

1 21 . The polypeptide of claim 20, wherein the mutant peptide sequence is 

2 selected from the group consisting of PRTGQIPTQTYS and PRTGQIPTQ AYS. 

1 22. The polypeptide of claim 1 6, wherein the hGH polypeptide comprises 

2 a mutant peptide sequence with a formula of L XTBOP UTG 
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3 wherein 

4 superscripts denote the position of the amino acid in the wild-type hGH amino 

5 acid sequence; and wherein 

6 X is selected from glutamic acid, valine and alanine; 

7 B is selcted from glutamine, glutamic acid, and glycine; 

8 O is selcted from serine and threonine; and 

9 U is selected from arginine, serine, alanine and leucine 

1 23. The mutant hGH polypeptide of claim 22, wherein the mutant peptide 

2 sequence is selected from the group consisting of: LETQSP 133 RTG, LETQSP 133 STG, 

3 LETQSP I33 ATG, LETQSP 133 LTG, LETETP 133 R, LETETP 133 A, LVTQSP 133 RTG, 

4 LVTETP 133 RTG, L VTETP 1 33 ATG, and LATGSP 133 RTG. 

1 24. The polypeptide of claim 16, wherein the hGH polypeptide comprises 

2 a mutant peptide sequence with a formula of M^PTXnZmOPLSRL 

3 wherein 

4 wherein the superscript denotes the position of the amino acid in the wild type 

5 hGH amino acid sequence (SEQ ID NO: 19); and 

6 B is selected from phenylalanine, valine and alanine or a combination thereof; 

7 X is selected from glutamate, valine and proline 

8 Z is threonine; 

9 O is selected from leucine and isoleucine; and 

10 when X is proline, Z is threonine; and 

11 wherein 

12 n and m are integers selected from 0 and 2. 

1 25. The polypeptide of claim 24, wherein the mutant peptide sequence is 

2 selected from the group consisting of M ! FPTE IPLSRL, M^PTV LPLSRL, and 

3 M 1 APTPTIPLSRL. 

1 26. The polypeptide of claim 24, wherein the mutant peptide sequence k 

2 JV^VTPTIPLSRL, wherein the superscript 1, denotes the first position amino acid in the 

3 wild type hGH amino acid sequence (SEQ ID NO: 19) 

1 27. The polypeptide of claim 1 5, wherein the mutant peptide sequence is 

2 selected from the group consisting of: LEDGSPTTGQIFKQTYS, 
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3 LED GSPTT AQIFKQT YS , LEDGSPTATQIFKQTYS, LEDGSPTQGAMFKQTYS, 

4 LEDGSPTQGAIFKQTYS, LEDGSPTQGQIFKQTYS, LEDGSPTTLYVFKQTYS, 

5 LEDGSPTINTIFKQTYS ? LEDGSPTTVSIFKQTYS, LEDGSPRTGQIPTQTYS, 

6 LEDGSPRTGQIPTQAYS, LEDGSPTTLQIFKQTYS, LETETPRTGQIFKQTYS, 

7 LVTETPRTGQIFKQTYS, LETQ SPRTGQIFKQT YS 5 LVTQSPRTGQIFKQTYS, 

8 LVTETPATGQIFKQTYS, LEDGSPTQGAMPKQTYS, and LEDGSPTTTQIFKQT YS . 

1 28. The polypeptide of claim 1, wherein the polypeptide is an IFN alpha 

2 polypeptide. 

1 29. The polypeptide of claim 28, wherein wherein the INF alpha 

2 polypeptide has a peptide sequence comprising a mutant amino acid sequence, and the 

3 peptide sequence corresponds to a region of INF alpha 2 having a sequence as shown in 

4 SEQ NO:22 5 and wherein the mutant amino acid sequence contains a mutation to a 

5 threonine or serine amino acid at a position corresponding to T of INF alpha 2. 

1 30. The polypeptide of claim 29, wherein the IFN alpha polypeptide is 

2 selected from the group consisting of IFN alpha, IFN alpha 4, IFN alpha 5, IFN alpha 6, 

3 IFN alpha 7, IFN alpha 8, IFN alpha 10, IFN alpha 14, IFN alpha 16, IFN alpha 17, and 

4 IFN alpha 21. 

1 31. The polypeptide of claim 30, wherein the IFN alpha polypeptide is an 

2 IFN alpha polypeptide comprising a mutant amino acid sequence selected from the group 

3 consisting of: 

4 "CVMQEERVTETPLMNADSIL 1 18 , "CVMQEEGVTETPLMNADSIL 1 1S , 

5 and "CVMQGVGVTETPLMNADSIL 1 18 . 

1 32. The polypeptide of claim 30, wherein the IFN alpha polypeptide is an 

2 IFN alpha 4 polypeptide comprising a mutant amino acid sequence selected from the 

3 group consisting of: 

4 "CVIQEVGVTETPLMNVDSIL 118 , and "CVIQGVGVTETPLMKEDSIL 118 . 

1 33 . The polypeptide of claim 30, wherein the IFN alpha polypeptide is an 

2 IFN alpha 5 polypeptide comprising a mutant amino acid sequence selected from the 

3 group consisting of: 
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4 "CMMQEVGVTDTPLMNVDSIL 1 18 , "CMMQEVGVTETPLMNVDSIL 1 18 

5 and "CMMQGVGVTDTPLMNVDSIL 118 . 

1 34. The polypeptide of claim 30, wherein the IFN alpha polypeptide is an 

2 IFN alpha 6 polypeptide comprising a mutant amino acid sequence selected from the 

3 group consisting of: 

4 99 CVMQEVWVTGTPLMNEDSIL 118 , 99 CVMQEVGVTGTPLMNEDSIL 1 18 3 

5 and "CVMQGVGVTETPLMNEDSIL 118 

1 35. The polypeptide of claim 30, wherein the IFN alpha polypeptide is an 

2 IFN alpha 7 polypeptide comprising a mutant amino acid sequence selected from the 

3 group consisting of: 

4 "CVIQEVGVTETPLMNEDFIL 1 18 ? and "CVIQGVGVTETPLMNEDFIL 118 . 

1 36. The polypeptide of claim 30, wherein the IFN alpha polypeptide is an 

2 IFN alpha 8 polypeptide comprising a mutant amino acid sequence selected from the 

3 group consisting of: 

4 "CVMQEVGVTESPLMYEDSIL U8 5 and "CVMQGVGVTESPLMYEDSIL 118 . 

1 37. The polypeptide of claim 30, wherein the IFN alpha polypeptide is an 

2 IFN alpha 10 polypeptide comprising a mutant amino acid sequence selected from the 

3 group consisting of: 

4 "CVIQEVGVTETPLMNEDSIL 118 , and "CVIQGVGVTETPLMNEDSIL 118 . 

1 38. The polypeptide of claim 30, wherein the IFN alpha polypeptide is an 

2 IFN alpha 14 polypeptide comprising a mutant amino acid sequence selected from the 

3 group consisting of: 

4 "CVIQEVGVTETPLMNEDSIL 1 18 ? and "CVIQGVGVTETPLMNEDSIL 118 . 

1 39. The polypeptide of claim 30, wherein the IFN alpha polypeptide is an 

2 IFN alpha 1 6 polypeptide comprising a mutant amino acid sequence selected from the 

3 group consisting of: 

4 "CVTQEVGVTEIPLMNEDSIL 1 18 , "CVTQEVGVTETPLMNEDSIL 1 18 5 and 

5 "CVTQGVGVTETPLMNEDSIL 1 1 8 . 
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1 40. The polypeptide of claim 30, wherein the IFN alpha polypeptide is an 

2 IFN alpha 17 polypeptide comprising a mutant amino acid sequence selected from the 

3 group consisting of: 

4 "CVIQEVGMTETPLMNEDSIL 118 , 99 CVIQEVGVTETPLMNEDSIL 118 , and 

5 "CVIQGVGMTETPLMNEDSIL 1 18 . 

1 41 . The polypeptide of claim 30, wherein the IFN alpha polypeptide is an 

2 IFN alpha 2 1 polypeptide comprising a mutant amino acid sequence selected from the 

3 group consisting of: 

4 "CVIQEVGVTETPLMNVDSIL 118 , and "CVIQGVGVTETPLMNVDSIL 118 . 
1 42. An isolated nucleic acid encoding the polypeptide of claim 1. 

1 43. An expression cassette comprising the nucleic acid of claim 42. 

1 44. A cell comprising the nucleic acid of claim 42. 

1 45. The polypeptide of claim 1, having a formula selected from: 

AA — O — GalNAc — X ; and AA— O— GalNAc — X 

3 wherein AA is an amino acid a side chain that comprises a hydroxyl moiety 

4 that is within the mutant peptide sequence; and X a modifying group or a saccharyl moiety. 

1 46. The polypeptide according to claim 45, wherein X comprises a group 

2 selected from sialyl, galactosyl and Gal-Sia moieties, wherein at least one of said sialyl, 

3 galactosyl and Gal-Sia comprises a modifying group. 

1 47. The polypeptide according to claim 45, wherein X comprises the 

2 moiety: 
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OH 




3 . OH 

4 wherein 

5 D is a member selected from -OH and R^L-HN-; 

6 G is a member selected from R^L- and -C(0)(Ci-C6)alkyl; 

7 R 1 is a moiety comprising a member selected a moiety comprising a straight- 

8 chain or branched poly(ethylene glycol) residue; and 

9 L is a linker which is a member selected from a bond, substituted or 

1 0 unsubstituted alkyl and substituted or unsubstituted heteroalkyl, 

1 1 such that when D is OH, G is R 1 -!,-, and when G is -C(0)(Ci-C 6 )alkyl 5 D is 

12 R^L-NH-. 

1 48. The polypeptide according to claim 45, wherein X comprises the 

2 structure: 



3 




4 in which L is a substituted or unsubstituted alkyl or substituted or unsubstituted 

5 heteroalkyl group; and n is selected from the integers from 0 to about 500. 

1 49. The polypeptide according to claim 45, wherein X comprises the 

2 structure: 
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3 




4 wherein s is selected from the integers from 0 to 20. 

1 50. A method for making a glycoconjugate of the polypeptide of claim 1, 

2 comprising the steps of: 

3 (a) recombinantly producing the polypeptide, and 

4 (b) enzymatically glycosylating the polypeptide with a modified sugar at 

5 said O-linked glycosylation site. 

1 5 1 . A pharmaceutical composition of a granulocyte colony stimulating 

2 factor (G-CSF) comprising: an effective amount of the polypeptide of claim 2, wherein 

3 said polypeptide is glycoconjugated with a modified sugar. 

1 52. The pharmaceutical composition according to claim 51, wherein said 

2 modified sugar is modified with a member selected from poly(ethylene glycol) and 

3 methoxy-poly(ethylene glycol) (m-PEG). 

1 53 . A pharmaceutical composition of human Growth Hormone (hGH) 

2 comprising an effective amount of the polypeptide of claim 1 6, wherein said polypeptide 

3 is glycoconjugated with a modified sugar. 

1 54. The pharmaceutical composition according to claim 53, wherein said 

2 modified sugar is modified with a member selected from poly(ethylene glycol) and 

3 methoxy-poly (ethylene glycol) (m-PEG). 

1 55. A pharmaceutical composition of a granulocyte macrophage colony 

2 stimulating factor (GM-CSF) comprising an effective amount of GM-CSF polypeptide 

3 comprising a mutant peptide sequence, wherein the mutant sequence comprises an O- 

4 linked glycosylation site that does not exist in a wild-type GM-CSF polypeptide, and 

5 wherein said polypeptidepeptide is glycoconjugated with a modified sugar. 
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1 56. The pharmaceutical composition according to claim 55, wherein said 

2 modified sugar is modified with a member selected from poly(ethylene glycol) and 

3 methoxy-poly (ethylene glycol) (m-PEG). 

1 57. A pharmaceutical composition of an interferon alpha-2b comprising an 

2 effective amount of the polypeptide of claim 28, wherein said polypeptide is 

3 glycoconjugated with a modified sugar. 

1 58. The pharmaceutical composition according to claim 57, wherein said 

2 modified sugar is modified with a member selected from poly(ethylene glycol) and 

3 methoxy-poly(ethylene glycol) (m-PEG). 

1 59. A method of providing G-CSF therapy to a subject in need of said 

2 therapy, said method comprising, administering to said subject an effective amount the 

3 pharmaceutical composition of claim 51. 

1 60. A method of providing granulocyte macrophage colony stimulating 

2 factor therapy to a subject in need of said therapy, said method comprising: 

3 administering to said subject an effective amount the pharmaceutical 

4 composition of claim 55. 

1 6 1 . A method of providing interferon therapy to a subject in need of said 

2 therapy, said method comprising: 

3 administering to said subject an effective amount the pharmaceutical 

4 composition of claim 57. 

1 62. A method of providing Growth Hormone therapy to a subject in need 

2 of said therapy, said method comprising: 

3 administering to said subject an effective amount the pharmaceutical 

4 composition of claim 53 . 
1 
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tt-2,3-sialyltransferase 
:ST3GAL-IV) 



Bos taurus 



n.d. AJ584673 CAE48298.1 




ct-2,6-siaiyltransferase 



Bos taurus 



n.d. 



AJ620651 CAF05850.1 




oc-2,8-sialy!transferase 
(Siat8D) 



Bos taurus 



n.d. 



AJ699421 CAG27883.1 




CMP cc-2,6- 

sialyltransferase (ST6Gai 



Bos taurus 



2.4.99.1 Y15111 CAA75385.1 
NM 17751 7 NP 803483.1 



018974 




sialyltransferase ST3GaI- 
II (Siat4B) 



Bos taurus 



AJ748841 CAG44450.1 




sialyltransferase ST3GaI- 
VI (SiatIO) 



Bos taurus 



iiBlil 

AJ748843 CAG44452.1 




St6GaINAc-VI 



Bos taurus 



n.d. 



AJ620949 CAF06586.1 




polysialyltransferase 
(PST) (fragment) ST8Sia 
IV 



Cercopithecus 
aethiops 



2.4.99.- AF21 0729 AAF1 71 05.1 Q9TT09 




a-2,3-sialyltransferase 
ST3Gal I (Siat4) 



Ciona intestinalis 



n.d. 



AJ626815CAF25173.1 




«X-2,8- 
polysialyltransferase 
ST8Sia IV 



Cricetulus griseus 2,4.99- 



-AAE28634 
Z46801 CAA86822.1 



Q64690 




GalP-1,3/4-GlcNAca- 
2,3-sialyItransferase 



Cricetulus griseus n.d. AY266676 AAP22943.1 Q80WK9 




cc-2,3-sialyltransferase 



Danio rerio 



AJ783741 CAH04018.1 




a-2,3-sialyltransferase 
ST3Gal IV (Siat4c) 



Danio rerio 



n.d. 



AJ744809 CAG32845.1 
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cc-2,6-sialyltransferase 
ST6Gai 



Danio rerio 



n.d. 



AJ744801 CAG32837.1 




CC-2,6-siaIyltransferase 
ST6GalNAc V (Siat7E) 



Danio rerio 



n.d. 



AJ646874 CAG26703.1 




a-2,8-sialyltransferase 
ST8Sia I (Siat 8A) 
(fragment) 




00-2,8-sialyltransferase 
ST8Sia IV (Siat 8D) 
(fragment) 




oc-2,8-sialyltransferase 
ST8Sia VI (Siat8F) 
(fragment) 



Danio rerio 




A/-glycan a-2,8- 
sialyitransferase 



Danio rerio 



BC050483 AAH50483.1 Q7ZU51 
AY055462 AAL1 7875.1 Q8QH83 
NM 153662 NP 705948.1 




oc-2,6-sialyltransferase 
(CG4871) ST6Gal I 



Drosophiia 
meianogaster 



AE003465AAF47256.1 
AF218237AAG13185.1 
AF397532AAK92126.1 
AE003465AAM70791.1 
NMJ379129 NP_523853.1 
NM 166684 NP 726474.1 




C£-2,3-siaiyltransferase 
ST3Gal ! 



Gallus gallus 



2.4.99.4 X80503CAA56666.1 Q11200 
NM 205217 NP 990548.1 




ot-2,3-sialytransferase 



Gallus gallus 



n.d. 



AJ585761 CAE51385.2 




ct-2 t 6-sialyltransferase 
ST6Gal I 



Gallus gallus 



2.4.99.1 X75558CAA53235.1 Q92182 
NM 205241 NP 990572.1 
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tt-2,6-sialyltransferase 
ST6GalNAc V (SIAT7E) 
(fragment) 



Gallus gallus 



n.d. 



AJ646877 CAG26706.1 




ct-Z.S-sialyltransferase 



Gallus gallus 



n,d. 



AJ699419CAG27881.1 




a-2,8-sialyltransferase 



Gallus gallus 



n.d. 



AJ699424 CAG27886.1 




P-galactosamide cc-2,6- 
sialyltransferase II 

[ST6Gal II) 

g^^^^ 

polysialyltransferase 
ST8Sia IV 



Gallus gallus 



n.d. 



AJ627629 CAF29497.1 



Gallus gallus ^4.99^ ' ^ % 042399" 




cc~2,3-sialyltransferase 
ST3Gal II 



2.4.99.4 U63090AAB40389.1 Q16842 
BC036777AAH36777,1 000654 
X96667CAA65447.1 
NM 006927 HP 008858.1 



pipiipil 

mm 
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a-2,3-sialyltransferase 
ST3Gal IV 



Homo sapiens 2.4.99.- 



. L23767AAA1 6460.1 
AF035249AAC14162.1 
BC01 0645 AAH1 0645.1 
AY040826AAK93790.1 
AF516602AAM66431.1 
AF516603AAM66432.1 
AF516604AAM66433.1 
AF5250J84AAM8 1378.1 
X74570 CAA52662.1 
CR456858CAG33139.1 
NM 006278 NP 006269.1 



Q11206 
060497 
Q96QQ9 
Q8N6A6 
Q8N6A7 
Q8NFD3 
Q8NFG7 




a-2,6-sia!yltransferase 
(ST6Galll;KIAA1877) 



Homo sapiens 



BC008680AAH08680.1 
AB058780 BAB47506.1 
AB059555 BAC24793.1 
AJ512141 CAD54408.1 
AX795193 CAE48260.1 
AX795193 CAE48261.1 
NM 032528 NP 115917.1 



Q86Y44 
Q81UG7 
Q96HE4 
Q96JF0 




a-2, 6-s i a ly transferase 
(ST6GalNAcV) 



BC001201 AAH01201.1 Q9BVH7 
AK056241 BAB71 127.1 
AL035409 CAB72344. 1 
AJ507292CAD45372.1 
NM 030965 NP 112227.1 




ct-2,6-sialyltransferase 
ST6Gal I 



Homo sapiens 



2.4.99.1 BC031476AAH31476.1 
BC040009AAH40009.1 
A1 7362 CAA01 327-1 
A23699CAA01686.1 
X17247CAA35111.1 
X54363CAA38246.1 
X62822CAA44634.1 
NMJ303 0 32 NP_003023.1 
NM 173216 NP 775323.1 



P15907 




a-2,8- 

polysialyltransferase 
ST8Sia IV 



Homo sapiens 



2.4.99,- L41680AAC41775.1 
BC027866AAH27866.1 
BC053657AAH53657.1 
NM 005668 NP 005659.1 



Q8N1F4 
Q92187 
Q92693 
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ST8Sia II 



U82762 AAB51 242.1 Q92470 
U33551 AAC24458.1 Q92746 
BC069584AAH69584.1 




cc-2 t 8-sialyitransferase 
ST8Sia V 



Homo sapiens 



2.4.99.- U91641 AAC51727/1 015466 
CR457037CAG33318.1 
NM 01 3305 NP 037437.1 




lactosylceramide cc-2,3- 
sialyitransferase (ST3Gal 

V) 



Homo sapiens 2.4,99.9 



AF1 05026 AAD1 4634.1 
AF119415AAF66146.1 
BC065936 AAH65936.1 
AY1 5281 5 AA01 6866.1 
AAP65066AAP65066.1 
AY359105AAQ89463.1 
AB018356 BAA33950.1 
AX876536 CAE89320.1 
NM 003896 NP 003887.2 



Q9UNP4 
094902 




/V-acetyigalactosaminide 
cc-2,6-sialyltransferase IV 
(ST6Ga!NAc IV) 



Homo sapiens 



2.4.99.- AF1 27142 AAF001 02,1 
BC036705AAH36705.1 
-AAP63349.1 
AB035172 BAA87034.1 
AK000600 BAA91281.1 
Y17461CAB44354.1 
AJ271734CAC07404.1 
AX061620CAC24981.1 
AX068265 CAC27250.1 
AX969252 CAF14360.1 
NMJ314403 NPJ355218.3 
NM 175039 NP 778204.1 



Q9H4F1 
Q9NWU6 
Q9UKU1 
Q9ULB9 
Q9Y3G3 
Q9Y3G4 




Homo sapiens 



n.d. 



AK021929 BAB1 3940.1 Q9HAA9 
AX881696CAE91353.1 
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Gal P-1 ,3/4-GlcNAc oc- 
2,3-sialyltransferase 
[ST3Gai IV) 



Mesocricetus 
auratus 



2.4.99.6 AJ245700 CAB53395.1 Q9QXF5 . 




polysialyltransferase 
(ST8Sia IV) 



Mesocricetus 
auratus 



2.4.99.- AJ245701 CAB53396.1 Q9QXF4 




a-2 f 3-sialyltransferase 
ST3Gal II 



St3gal2 Mus musculus 



2.4.99.4 BC01 5264 AAH 1 5264.1 
BC066064AAH66064.1 
AK034554 BAC28752.1 
AK034863 BAC28859.1 
AK053827 BAC35543.1 
X76989 CAA54294.1 
NMJD09179 NPJ333205.1 
NM 178048 NP 835149.1 



Q11204 
Q8BPL0 
Q8BSA0 
Q8BSE9 
Q91WH6 




a-2,3-sialyltransferase 
ST3Gal IV 



St3gal4 Mus musculus 2.4.99.4 



BC01 1 121 AAH1 1121.1 
BC050773 AAH50773.1 
D28941 BAA06068.1 
AK0O8543 BAB25732.1 
AB061305 BAB47508.1 
X95809 CAA65076.1 
NM 009178 NP 033204.2 



P97354 
Q61325 
Q91Y74 
Q921R5 
Q9CVE8 




cc-2,6-sialyltransferase St6galnac2 Mus musculus 
ST6GalNAc II 



2.4.99.- NMJD091 80 6677963 

BC01 0208 AAH 10208.1 
AB027198 BAB00637.1 
AK004613BAB23410.1 
X93999CAA63821.1 
X94000CAA63822.1 
NM 009180 NP 033206.2 



P70277 

Q9DC24 
Q9JJM5 




NM 172829 NP 766417.1 




cc-2,6-siaiyitransferase St6galnac3 Mus musculus 
ST6GalNAc 111 



n.d. 



BC058387AAH58387.1 
AK034804BAC28836.1 
Y11342 CAA72181.2 
Y11343 CAB95031.1 



Q9WUV2 
Q9JHP5 
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cc-2,8-sialyltransferase 
(GD3 synthase) ST8Sia I 



St8sia 1 Mus musculus 



L38677AAA91 869.1 
BC024821 AAH24821.1 
AK046188BAC32625.1 
AK052444 BAC34994.1 
X84235 CAA59014.1 
AJ401102 CAC20706.1 
NM 011 374 NP 035504.1 



Q64468 
Q64687 
Q8BL76 
Q8 BWfO 
Q8K1C1 
Q9EPK0 




ct-2 t 8-sialyltransferase 
ST8Sia II 



St8sia2 Mus musculus 



2.4.99.- 



X83562CAA58548.1 035696 
X99646 CAA67965.1 
X99647CAA67965.1 
X99648 CAA67965.1 
X99649 CAA67965.1 
X99650 CAA67965.1 
X99651 CAA67965.1 
NM 009181 NP 033207.1 




cc-2,8-sialyltransferase 
ST8Sia V 



St8sla 5 Mus m usculus 



2.4.99.- 



_ _ ^PM ^|^MW* 

BC034855 AAH34855.1 
AK078670 BAC37354.1 
X98014CAA66642.1 
X98014CAA66643.1 
X98014CAA66644.1 
NM_0 13666 NPJ338694.1 
NM_1 53124 NP_694764.1 
NM 177416 NP 803135.1 



P70126 
P70127 
P70128 
Q8BJW0 
Q8JZQ3 




GD1 synthase 
(ST6GalNAc V) 



St6galnac5 Mus musculus 



n.d, 



BC055737AAH55737.1 
AB030836 BAA85747.1 
AB028840 BAA89292.1 
AK034387BAC28693.1 
AK038434 BAG29997.1 
AK042683 BAC31331.1 
NM 012028 NP 036158.2 



Q8CAM7 
Q8CBX1 
Q9QYJ1 
Q9R0K6 




A/-acetylgalactosaminide St6galnac6 Mus musculus 
ot-2,6-sialyltransferase 
(ST6GalNAc VI) 



2.4.99.- 




BC036985AAH36985.1 
AB035174 BAA87036.1 
AB035123BAA95940.1 
AK030648BAC27064.1 
NM 016973 NP 058669.1 



Q8CDC3 
Q8JZW3 
Q9JM95 
Q9R0G9 



mam 



a-2,3-sialyltransferase 




Oncorhynchus 




n.d. AJ585760 CAE51 384.1 
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myk'iss 





oc-2,8- 

polysialyltransferase IV 
(ST8Sia IV) 



Oncorhynchus 
mykiss 



n.d. 



AB094402 BAC7741 1 .1 




Q7T2X5 




ct-Z^-sialyltransferase 
STSGal IV 



Oryctolagus 
cuniculus 



2.4.99.- AF1 21 967 AAF28871.1 Q9N257 




OSJNBa0043L24.2 or 
OSJNBb0002J11.9 



Oryza sativa 
(japonlca cultivar- 



n.d. 



AL731 626'CAD41 185.1 
AL662969 CAE04714.1 




tt-2,6-sialyltransferase 
ST6GaINAcV(Siat7E) 



Oryzias latlpes 




a-2,6-sialy (transferase 



Pan troglodytes 



n.d. 



AJ748740 CAG38615.1 




.cc-2,6-sialyltransferase 



Pan troglodytes 




tx.-2,6-sialyltransferase 



Pan troglodytes 




a-2,8-sia!y!transferase 8A . 



Pan troglodytes 2.4.99.8 AJ697658 CAG26896.1 




iSJBFSiSSteaSSwjSsS 

a-2,8-sia!yltransferase 
8C (Siat8C) 



Pan troglodytes 




a-2,8-sialyltransferase 8E 
(SiatgE) 



Pan troglodytes 




P-galactosamide cc-2,6- 
sialyltransferase 1 



Pan troglodytes 




GM3 f?vnthasfi RT^rtai \/ 
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a-2,3-sialyltransferase 
ST3Gal III 



Rattus norvegicus 2,4.99.6 M97754AAA42 146.1 Q02734 

NM 031 697 NP 113885.1 




oc-2 ,3-sialyltransferase 
ST3Gal VI 



Ra ttus n orvegicus n.d. 



AJ626743 CAF25053.1 




a-2,6-slalyitransferase Rattus norvegicus n.d. 

ST6GalNAc IV (Siat7D) 



AJ646871 CAG26700.1 




tt-2,6-sialyltransferase 
ST6GalNAc VI (Siat7F) 



Rattus norvegicus n.d. 



AJ646881 CAG26710.1 




<x>2,8-sialyltransferase 
[S1AT8E) 



Rattus norvegicus n.d. AJ699422 CAG27884.1 




oo-2 t 8-sialy (transferase 
ST8Sia II 

m 



Rattus norvegicus 2.4.99- 



L13445AAA42147.1 Q07977 
NM 057156 NP 476497.1 Q64688 




a>2,8-sialyltransferase 
ST8Sia IV 



Rattus norvegicus 2.4.99.- 



U9021 5 AAB49989.1 008563 




GM3 synthase ST3Gal V 



Rattus norvegicus n.d. 



AB018049 BAA33492.1 
NM 031337 NP 112627.1 



088830 




00-2,3-siaIyltransferase 



Silurana trvpicatis 



AJ585763CAE51387.1 




cc-2,6-sialy!transferase 



Strongyiocentrotus n.d, AJ699425CAG27887.1 




cc-2 , 3-sia ly Itra n sferase 



Sus scrofa 



AJ584674CAE48299.1 




a>2,6-sialyltransferase 



Sus scrofa 



2.4.99.1 AF1 36746 AAD33059.1 Q9XSG8 




sialyltransferase 
(fraamenti ST6Gai I 



sus scrofa 



AF041031 AAC15633.1 062717 



18/23 



WO 2005/070138 



PCT/US2005/000799 



FIGURE" 



9«f 



10 J 




a-2 , 3-slalyltransferase 
'Siat5 



AJ744805 CAG32841.1 




a-2,3-sia!yltransferase 
ST3Gal II (Siat5) 



Takifugu rubripes 



n.d. AJ6268 1 7 CAF25175.1 




a-2,6-sialyltransferase 



Takifugu rubripes n.d. AJ744800 CAG32836.1 




a-2,6-sia!yitransferase 
ST6GalNAc II B (Siat7B- 
related) 



Takifugu rubripes 




00-2,6-siaIyltransferase 
ST6GalNAc IV (siat7D) 
(fragment) 



Takifugurubripes 2.4.99,3 



Y1 7466 C AB44338 . 1 Q9W6U6 
AJ646869CAG26698.1 




.a-2,6-siaIyltransferase 
ST6GaINAc VI (Siat7F) 
(fragment) 



Takifugu rubripes 



AJ646880 CAG26709.1 




0t-2,8~sialy!transferase 
ST8Sia li (Slat 8B) 



Takifugu rubripes n.d. AJ71 5538 CAG29377.1 




a-2,8-sialyltransferase 
ST8Sia INrfSiat 8Cr) 



Takifugu rubripes 



AJ715542 CAG29381.1 




a-2,8-sialyItrarisferase 
ST8Sia VI (Siat8F) 
(fragment) 



Takifugu rubripes 



AJ715549 CAG29388.1 




a-2,3-sialyItransferase 
(Siat5-r) 



Tetraodon 
nigroviridis 



n.d. 



AJ744806 CAG32842.1 




a-2,3-sialyltransferase 
S' 



Tetraodon 



AJ626822CAF25180.1 




a-2, 6-sialy (transferase 
ST6GalNAc V (Siat7E) 
(fragment) 



Tetraodon 
nigroviridis 



n.d. 



AJ646879 CAG26708.1 




a-2,8-sialyltransferase 
ST8Sia II (Siat 8B) 
(fragment) 



Tetraodon 
nigroviridis 



AJ715537CAG29376.1 
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a-2 t 8-sialyltransferase 
ST8Sia Mr (Slat 8Cr) 
[fragment 



Tetraodon 
nigroviridis 



n.d. 



AJ715540CAG29379.1 




ct-2,3-sialyltransferase 
:st3Gal-ll) 

H 



Xenopus laevis 



n.d. 



AJ585762 CAE51 386.1 




a-2 t 3-sialyltransferase 
StSGal-ili ' 



11 

Xenopus laevis 



AJ585764 CAE51 388.1 
AJ626823 CAF25181.1 




cc-2,8-sialyltransferase 
ST8SiCC-l (Siat8A;GD3 
synthase) 



Xenopus laevis 



n.d. 



AY272056AAQ16162.1 
AY272057 AAQ1 61 63. 1 
AJ704562 CAG28695.1 




a-2,3-sialyltransferase 
(3Gal r VI) 

t^fejE^cfs' - * - . r -r- ■ 



Xenopus tropicalis n,d. AJ626744 CAF25054.1 




rc-2,6-sialyltransferase 
ST6Ga!NAc V (Siat7E) 
[fragment) 



Xenopus tropicalis n.d. AJ646878 CAG26707.1 




P-galactosamide Ct-2,6- 
sialyltransferase II 



Xenopus tropicalis n.d. AJ627628 CAF29496.1 




polysialyltransferase 



Escherichia coli K92 



2.4.-.- 



M88479AAA24215.1 Q47404 




SynE 



Neisseria meningitidis 
FAM18 



n.d. 



U75650AAB53842.1 006435 




SiaD (fragment) 



Neisseria meningitidis 
M209 



n.d. AY281046AAP34769.1 




polysialyltransferase (SiaD)(fragment) Neisseria meningitidis 

M3315 



n.d. AY2341 91 AAQ85289.1 




polysialyltransferase (SiaD)(fragment) Neisseria meningitidis 

M4211 



n.d. AY2341 90 AA085288.1 




polysialyltransferase (SiaD)(fragment) Neisseria meningitidis 

M5177 



n.d. AY2341 93 AAQ85291 .1 




SiaD (fragment) 



Neisseria meningitidis 
M980 



n.d. AY281045AAP34768.1 
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ORF2 



Haemophilus 
influenzae A2 



n.d, 



M94855AAA24979.1 




ct-a^-sialyltransferase 



Neisseria 2.4.99.4 
gonorrhoeae 



U60664 AAC44539.1 P72074 
AAE67205.1 




CC-2,3-sialyltransferase 



meningitidis 
406Y, NRCC 
4030 




NMA1118 



Neisseria 
meningitidis 
Z2491 



AL162755 CAB84380.1 Q9JUV5 
NC_003116NP_283887.1 




Salmonella 
enterica 
SARB25 



n.d, AF519787AAM82550.1 Q8KS93 




WaaH 



Salmonella 
enterica 
SARB39 



n.d. AF51 9789 AAM82552.1 




WaaH 



Salmonella 
enterica 



n.d. AF519791 AAM82554.1 Q8KS91 




Salmonella 
enterica 
SARC12 




WaaH (fragment) 




Salmonella 
enterica 
SARC14I 




AF519783AAM88844.1 Q8KS97 




Salmonella 
enterica 
SARC16II 



n.d. AF519785AAM88846.1 Q8KS95 
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WaaH (fragment) 



Salmonella 
enterica 
SARC4I 



AF519773AAM88835.1 Q8KSA3 




WaaH 



Salmonella 
enterica 
SARC6lla 



n.d. AF519775AAM88837.1 Q8KSA2 




WaaH 



Salmonella 

enterica 

SARC9V 



n.d. AF519778AAM88839.1 Q8KSA0 





Cst 



Campylobacter jejuni 
81-176 



n.d, AF305571 AAL09368.1 



^^^^^^^^^^^^ 




cc-2 t 3-sialy transferase (Cst- Campylobacter jejuni 2.4.99.- AF400047AAK85419.1 

ill) ATCC 43430 

oc-2,3/8-siafyltransferase ^Campylobacter jejuni n.d. AF400048 AAK9 1725.1 Q93MQ0 

(Cstl!) ATCC 43438 



ana 




Ct-2 t 3-sialyltransferase (Cst- Campylobacter jejuni 2.4.99.- AF401528 AAL05990.1 Q93D05 




a-2,3/8-sia!yltransferase 

(Cst-n 



Campylobacter jejuni 
ATCC 700297 



n.d. AF216647AAL36462.1 




a^^-sialyltransferase cstlll Campylobacter jejuni 2.4,99.- AF1 95055 AAG29922.1 

MSC57360 




cc-2,3/&-2,8-sialyltransferase Campylobacter jejuni 
II (cstl I) C:10 



n.d. -AA096669>1 
AX934427CAF04167.1 




oo2,3/a-2 f 8-sialyltransferase Campylobacter jejuni . 
II (Cstl)) 0:36 



AX934436CAF04171.1 




tt-2,3/cc-2,8-siaiyltransferase Campylobacter jejuni 
II (Cstll) 0:4? 



-AAO96670.1 
-AAT17967.1 
AX934429CAF04168.1 




Afunctional a-2,3/-2, 8- 
sialyltransferase (Cst-ll) 



Campylobacter jejuni 2.4.99.- AF130984 AAF31771.1 
OH4384 AX934425CAF04166.1 



1R07C 
1R08A 
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1 










PM1174 



Pasteurella multocida 
PM70 




AE0061 57 AAK03258.1 Q9CLP3 
NC 002663 NP 246111.1 




Sequence 10 from patent US Unknown. 
6699706 



Mfm 

AAT1 7969.1 




Sequence 2 from patent US Unknown. 
6709834 



-AAT23232.1 




Sequence 3 from patent US Unknown. 
6699705 




Sequence 35 from patent US Unknown. 
6503744 (fragment) 



-AA096685.1 

-AAS36262.1 




Sequence 5 from patent US Unknown. 
6699705 



n.d. 



-AAT1 7966.1 
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